(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 
International Bureau 




(43) International Publication Date (10) International Publication Number 

29 March 2001 (29.03.2001) PCT WO 01/21772 A2 



(51) International Patent Classification 7 : C12N 9/00 Church Strcet, North Attleboro, MA 02760 (US). HER- 

MANN, Theron; 18 Chilhowie Drive, Kinnelon, NJ 07405 

(21) International Application Number: PCT/USOO/25993 ( US )- per O> Janice, G.; 20 Solomon Pierce Road, Lex-- 

ington, MA 02420 (US). 

(22) International Filing Date: 

21 September 2000 (21.09.2000) < 74) A 8 ents: HANLEY, Elizabeth, A. et al.; Lahive & Cock- 



field, LLP, 28 State Street, Boston, MA 02109 (US). 



(25) Filing Language: English 



(81) Designated States (national): AU, BR, CA, CN, CZ, HU, 
ID, IL, IN, JP, KR, LT, LV, MX, NO, PL, RU, SG, SK, TR, 
(26) Publication Language: English tja, ZA. 

(30) Priority Data: (84) Designated States (regional): European patent (AT, BE, 

09/400,494 21 September 1999 (21.09.1999) US CHf CY, DE, DK, ES, FI, FR, GB, GR, IE, IT, LU, MC, 

60/2 10,072 7 June 2000 (07.06.2000) US NT., PT, SE). 

60/221,836 28 July 2000 (28.07.2000) US 

60/227,860 24 August 2000 (24.08.2000) US p ub|ished . 

— Without international search report and to be republished 
^ (71) Applicant: OMNIGENE BIOPRODUCTS [US/US]; upon receipt of that report. 

EE!: 763(d) Concord Avenue, Cambridge, MA 02138 (US). 

— For two-letter codes and other abbreviations, refer to the "Guid- 

= (72) Inventors: YOCUM, Rogers, R.; 4 Orchard Lane, Lex- ance Notes on Codes and Abbreviations" appearing at the begin- 
= ington, MA 02420 (US). PATTERSON, Thomas, A.; 89 ning of each regular issue of the PCT Gazette. 



^3 (54) Title: METHODS AND MICROORGANISMS FOR PRODUCTION OF PANTO-COMPOUNDS 

f^l (57) Abstract: The present invention features methods of producing panto-compounds {e.g., pantothenate) using microorganisms 
in which the pantothenate biosynthetic pathway and/or the isoleucine-valine biosynthetic pathway and/or the coenzymeA biosyn- 
thetic pathway has been manipulated. Methods featuring ketopantoate reductase overexpressing microorganisms as well as aspartate 
fsj oc-decarboxylase overexpressing microorganisms are provided. Methods of producing panto-compounds in a precursor-independent 
^ manner and in high yield arc described. Recombinant microorganisms, vectors, isolated nucleic acid molecules, genes and gene 
^ products useful in practicing the above methodologies are also provided. The present invention also features a previously unidenti- 
tied microbial pantothenate kinase gene, coaX, as well as methods of producing panto-compounds utilizing microorganisms having 
Q modified pantothenate kinase activity. Recombinant microorganisms, vectors, isoIatedcoaA' nucleic acid molecules and purified 
^ CoaX proteins are featured. Also featured are methods for identifying pantothenate kinase modulators utilizing the recombinant 
^ microorganisms and/or purified CoaX proteins of the present invention. 



WO 01/21772 



PCTYUS00/25993 



-1- 

METHODS AND MICROORGANISMS FOR 
PRODUCTION OF PANTO-COMPOUNDS 

Background of the Invention 

5 Pantothenate, also known as pantothenic acid or vitamin B5, is a member of the 

B complex of vitamins and is a nutritional requirement for mammals, including 
livestock and humans (e.g., from food sources, as a water soluble vitamin supplement or 
as a feed additive). In cells, pantothenate is used primarily for the biosynthesis of 
coenzyme A (CoA) and acyl carrier protein (ACP). These coenzymes function in the 

10 metabolism of acyl moieties which form thioesters with the sulfhydryl group of the 4'- 
phosphopantetheine portion of these molecules. These coenzymes are essential in all 
cells, participating in over 100 different intermediary reactions in cellular metabolism. 

The conventional means of synthesizing pantothenate (in particular, the bioactive 
D isomer) is via chemical synthesis from bulk chemicals, a process which is hampered 

15 by excessive substrate cost as well as the requirement for optical resolution of racemic 
intermediates (e.g., resolution of DL-pantolactone to obtain D-pantolactone for chemical 
condensation with p-alanine). Accordingly, researchers have recently looked to 
bacterial or microbial systems that produce enzymes useful in pantothenate biosynthesis 
processes (as bacteria are themselves capable of synthesizing pantothenate). In 

20 particular, bioconversion processes have been evaluated as a means of favoring 
production of the D isomer of pantothenic acid, e.g., using microorganisms which 
selectively hydrolyze a DL-pantothenic acid ester to D-pantothenic acid; 
microorganisms which selectively decompose L-pantolactone resulting in D- 
pantolactone alone; and microorganisms which selectively hydrolyze DL-pantolactone 

25 to D-pantoic acid. 

There is still, however, significant need for improved pantothenate production 
processes, in particular, for processes requiring reduced quantities of substrates and/or 
less expensive substrates. To this end, methods of direct microbial synthesis have 
recently been examined as a means of improving D-pantothenate production. In 

30 microbes, pantothenate biosynthetis is a multistep pathway resulting in condensation of 
pantoate (derived from a-ketoisovalerate) and P-alanine to form D-pantothenate. The 
isoleucine-valine (/7v) pathway biosynthetic enzymes, acetohydroxyacid synthetase (the 
ilvBN or alsS gene product), acetohydroxyacid isomeroreductase (the ilvC gene product) 
and dihydroxyacid dehydratase (the ilvD gene product) catalyze the conversion of 

35 pyruvate to a-ketoisovalerate. The reactions are further catalyzed by the pantothenate 
(pari) pathway biosynthetic enzymes ketopantoate hydroxymethyltransferase (the panB 
gene product), ketopantoate reductase (the panE gene product), aspartate-a- 
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decarboxylase (the panD gene product) and pantothenate synthetase (the panC gene 
product). 

The genes encoding the enzymes involved in the biosynthesis of pantothenic acid 
in Salmonella typhimurium and Escherichia coli have recently been identified and 
5 characterized (Frodyma and Downs (1998) J. Biol. Chem. 273:5572-5576 and 

Jackowski (1996) pp. 687-694, In Neidhardt et al (ed.) Escherichia coli and Salmonella: 
Cellular and Molecular Biology, 2 nd ed. Am. Soc. Microbiol. Wash, D.C). In E. coli, for 
example, the biosynthesis of pantothenic acid consists of four key steps. The first 
reaction is catalyzed by the panB gene product, ketopantoate hydroxymethyltransferase, 

10 and uses the L-valine intermediate a-ketoisovalerate to generate ketopantoate, which is 
subsequently reduced to pantoate by the panE gene product, ketopantoate reductase. 
The panD gene product, aspartate-a-decarboxylase, generates P-alanine from aspartate. 
The panC gene product, pantothenate synthetase, subsequently ligates P-alanine with 
pantoate to yield D-pantothenate. 

1 5 The authors Dusch et al. described the identification of the Corynebacterium 

glutamicum panD gene and reported that expression of the C. glutamicum panD gene in 
E. coli yielded a strain producing pantothenate with a specific productivity of 140 ng of 
pantothenate per mg (dry weight) per hour. (Dusch et al (1999) Appl Environ. 
Microbiol. 65:1530-1539). 

20 The authors Sahm and Eggeling have further identified the Corynebacterium 

glutamicum panB and pan C genes and have described a genetically engineered strain of 
C. glutamicum which overexpresses the panBC genes (Sahm and Eggeling (1999) Appl 
Environ. Microbiol. 65:1973-1979). The engineered strain produces pantothenate, 
however, it was necessary to overexpress the genes responsible for a-ketoisovalerate 

25 production in the host organism in order that pantothenic acid production could be 
detected. Moreover, without the addition of p-alanine, no substantial amounts of 
pantothenate accumulated with the strain constructed. 

Likewise, a method of producing D-pantothenic acid has been described that 
takes advantage of a sodium salicylate resistant mutant strain of E. coli which produces 

30 D-pantothenic acid when cultured in the presence of P-alanine (U.S. Patent No. 

5,5 1 8,906). Generation of E. coli strains resistant to a-ketoisovaleric acid and/or ct- 
ketobutyric acid, and/or a-aminobutyric acid, and/or p-hydroxyaspartic acid and/or O- 
methyl-threonine, in addition to salicylic acid, further increased pantothenic acid 
production. Moreover, transformation of a plasmid DNA carrying the panB, panC and 

35 panD genes into the salicylic acid resistant mutant strain resulted in increased 
pantothenate production, however, up to 20 g/L P-alanine or more was fed in the 
examples given. The panB-panC-panD genes are clustered on the E. coli chromosome. 
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Finally, a method of producing D-pantothenic acid has been described which 
utilizes a salicylic acid-resistant, a-ketoisovalerate-resistant, a-ketobuty rate-resistant, P- 
hydroxyaspartate-resistant, o-methylthreonine-resistent E. coli strain transformed with 
pantothenate biosynthesis gene-containing DNA fragments and/or branched amino acid 
5 biosynthesis gene-containing DNA fragments and cultured in the presence of P-alanine 
(U.S. Patent No. 5,932,457). 

Pantothenate production in bacteria results from the condensation of pantoate 
and p-alanine and involves the pantothenate biosynthetic en2ymes ketopantoate 
hydroxymethyltransferase (the panB gene product), ketopantoate reductase (the panE 

10 gene product), aspartate-a-decarboxylase (the panD gene product) and pantothenate 
synthetase (the panC gene product). Although pantothenate is biologically active as a 
vitamin, it is further metabolized in all cells to Coenzyme A (CoA) which participates as 
an acyl group carrier in the tricarboxylic acid (TCA) cycle, fatty acid metabolism and 
numerous other reactions of intermediary metabolism. The initial (and possibly rate- 

1 5 controlling) step in the conversion of pantothenate to Coenzyme A (CoA) is 

phosphorylation of pantothenate by pantothenate kinase. A pantothenate kinase activity 
was first identified in Salmonella typhimurium by screening for temperature-sensitive 
mutants which synthesized CoA at permissive temperatures but excreted pantothenate at 
non-permissive temperatures. The mutations were mapped in the Salmonella 

20 chromosome and the genetic locus was designated coaA. The gene encodes the enzyme 
that catalyzes the first step in the biosynthesis of coenzyme A from pantothenate (Dunn 
and Snell (1979) J. Bacteriol 140:805-808). Escherichia coli temperature sensitive 
mutants have also been isolated and characterized (Vallari and Rock (1987) J. Bacteriol 
169:5795-5800). These mutants (named coaA15(Ts)) are defective in the conversion of 

25 pantothenate to CoA and further exhibit a temperature-sensitive growth phenotype, 
indicating that pantothenate kinase activity is essential for growth. Moreover, it was 
noted that CoA inhibited pantothenate kinase activity to the same degree in the mutant 
as compared to the wild-type enzyme. 

Feedback resistant E. coli mutants (named coaA16(Fr)) have also been isolated 

30 that posses a pantothenate kinase activity that is refractory to feedback inhibition by 
CoA (Vallari and Jackowski (1988) 1 Bacteriol 170:3961-3966). The mutation 
responsible for the reversion is, suprisingly, not genetically linked to the coaA gene by 
transduction. Additional data described therein support the view that the total cellular 
CoA content is controlled by both modulation of biosynthesis at the pantothenate kinase 

35 step and possibly by degradation of CoA to 4*-phosphopantetheine. 
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The wild-type E. coli coaA gene was cloned by functional complementation of 
E. coli temperature-sensitive mutants. The sequence of the wild-type gene was 
determined (Song and Jackowski (1992)7. Bacterioi 174:641 1-6417 and Flamm et al 
(1988) Gene (Amst.) 74:555-558). Strains containing multiple copies of the coaA gene 
5 possessed 76-fold higher specific activity of pantothenate kinase, however, there was 
only a 2.7-fold increase in the steady state level of CoA (Song and Jackowski, supra). It 
has further been reported that the prokaryotic enzyme (encoded by coaA in E.coli and a 
variety of other microorganisms) is feedback inhibited by CoA both in vivo and in vitro 
with CoA being about five times more potent than acetyl-CoA in inhibiting the enzyme 
1 0 (Song and Jackowski, supra and Vallari et al. , supra). Moreover, it has been reported 
that the panB gene product in E. coli is inhibited by CoA (Powers and Snell (1976) J. 
Biol Chem. 251 :3786-3793). These data further support the view that feedback 
inhibition of pantothenate kinase activity is a critical factor controlling intracellular CoA 
concentration. 

1 5 Using standard search and alignment tools, coaA homologues have been 

identified in Hemophilus influenzae, Mycobacterium tuberculosis, Vibrio cholerae, 
Streptococcus pyogenes and Bacillus subtilis. By contrast, proteins with significant 
similarity could not be identified in eukaryotic cells including Saccharomyces cerevisiae 
or in mammalian expressed sequence tag (EST) databases. Using a genetic selection 

20 strategy, a cDNA encoding pantothenate kinase activity has recently been identified 
from Aspergillus nidulans (Calder et al. (1999) J. Biol. Chem. 274:2014-2020). The 
eukaryotic pantothenate kinase gene (panK) has distinct primary structure and unique 
regulatory properties that clearly distinguish it from its prokaryotic counterpart. A 
mammalian pantothenate kinase gene (mpanKla) has also been isolated which encodes 

25 a protein having homology to the A. nidulans PanK protein and to the predicted gene 
product of GenBank™ Accession Number 927798 identified in the S. cerevisiae 
genome (Rocker al. (2000)7. Bioi Chem. 275:1377-1383). 

Summary of the Invention 

30 The present invention is based, at least in part, on the discovery of key enzyme- 

encoding genes of the pantothenate biosynthetic pathway in Bacillus subtilis. In 
particular, the present inventors have identified the panE gene of B. subtilis. 
Overexpression or deregulation of the panE gene in B. subtilis results in enhanced 
production of the panE gene product, ketopantoate reductase, further resulting in 

35 increased production of pantothenate. Likewise, mutations in this gene reduce 
pantothenate production in B. subtilis >90%. The present inventors have further 
identified the presumptive panBCD operon in B. subtilis, overexpression or 
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deregulation of which results in increased pantothenate production. The present 
inventors have further demonstrated that overexpression or deregulation of the panD 
gene in B. subtilis (resulting in enhanced production of the panD gene product, 
aspartate-a-decarboxylase) further results in increased production of pantothenate, in 
5 particular, in combination with deregulation of genes encoding key enzymes of the 
isoleucine-valine (ilv) biosynthetic pathway. 

Accordingly, the present invention features methods of producing pantothenate, 
as well as other compounds of the pantothenate biosynthetic pathway (e.g., 
ketopantoate, pantoate and P-alanine), termed "panto-compounds" herein, using 

10 microorganisms in which the pantothenate biosynthetic pathway and/or isoleucine- 
valine biosynthetic pathway has been manipulated such that pantothenate or other 
desired panto-compounds are produced. In one embodiment, the invention features a 
method of producing a panto-compound (e.g., pantothenate or pantoate) that involves 
culturing a microorganism which overexpresses the panE gene product, ketopantoate 

15 reductase, also referred to herein as a ketopantoate reductase-overexpressing or 
"KPAR-O" microorganism, under conditions such that the panto-compound (e.g., 
pantothenate or pantoate) is produced. In another embodiment, the present invention 
features a method of producing panto-compounds (e.g., pantothenate or pantoate) 
which includes culturing a microorganism which overexpresses at least one 

20 pantothenate biosynthetic enzyme (e.g. , at least one of the panB, panC or panD gene 
products), preferably in a KPAR-O microorganism, under conditions such that the 
panto-compound (e.g., pantothenate or pantoate) is produced. 

Yet another aspect of the invention features methods of producing panto- 
compounds which are independent of the need to feed precursors (e.g., p-alanine or 

25 aspartate and/or a-ketoisovalerate or valine). In one embodiment, the invention 

features a method of producing pantothenate in a manner independent of precursor feed 
that includes culturing an aspartate-a-decarboxylase-overexpressing (AaD-O) 
microorganism having a deregulated isoleucine-valine (;7v) pathway under conditions 
such that pantothenate is produced. In another embodiment, the invention features a 

30 method of producing pantothenate in a manner independent of precursor feed that 

includes culturing an AaD-0 microorganism having a deregulated pantothenate (pan) 
pathway and a deregulated isoleucine-valine (/7v) pathway, under conditions such that 
pantothenate is produced. In another embodiment, the invention features a method of 
producing pantothenate in a manner independent of aspartate or P-alanine feed that 

35 includes culturing an AaD-0 microorganism under conditions such that pantothenate is 
produced. In another embodiment, the invention features a method of producing 
pantothenate in a manner independent of valine or a-ketoisovalerate feed that includes 
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culturing a microorganism having a deregulated isoleucine-valine (/7v) biosynthetic 
pathway under conditions such that pantothenate is produced. In yet another 
embodiment, the invention features a high yield production method for producing 
pantothenate that includes culturing a manipulated microorganism under conditions 
5 such that pantothenate is produced at a significantly high yield (e.g., at a level greater 
than 10 g/L, 20 g/L, 30 g/L or 40g/L). 

The methods of the present invention further feature microorganisms that 
overexpresses acetohydroxyacid synthetase or acetohydroxyacid isomeroreductase (e.g., 
microorganisms transformed with a vector that includes an ilvBNC nucleic acid 
1 0 sequence), microorganisms that overexpresses dihydroxyacid dehydratase {e.g. , 

microorganisms transformed with a vector that includes an ilvD nucleic acid sequence), 
microorganisms that overexpresses aspartate-a-decarboxylase (e.g., microorganisms 
transformed with a vector that includes a panD nucleic acid sequence), microorganisms 
having a deregulated isoleucine-valine (//v) biosynthetic pathway and microorganisms 
1 5 having a deregulated pantothenate biosynthetic pathway (e.g. , microorganisms that 
overexpress any of ketopantoate hydroxymethyltransferase, ketopantoate reductase, 
pantothenate synthetase and aspartate-a-decarboxylase, for example, microorganisms 
transformed with a vector comprising a panBCD nucleic acid sequence or a vector 
comprising a panEl nucleic acid sequence). In one embodiment, the recombinant 
20 microorganism is Gram positive (e.g., microorganisms belonging to the genus Bacillus, 
Cornyebacterium, Lactobacillus, Lactococci or Streptomyces). In another embodiment, 
the recombinant microorganism is Gram negative. Particularly preferred is a Bacillus 
recombinant microorganism {e.g., Bacillus licheniformis, Bacillus amyloliquefaciens, 
Bacillus subtilis, Bacillus pumilus, Bacillus halodurans, and the like). Recombinant 
25 vectors that contain the genes encoding Bacillus pantothenate and/or isoleucine-valine 
biosynthetic enzymes (e.g., B. subtilis pantothenate and/or isoleucine-valine biosynthetic 
enzymes) are also described. 

Also featured are methods of producing P-alanine that include culturing an 
aspartate-a-decarboxylase-overexpressing (AaD-O) microorganism under conditions 
30 such that P-alanine is produced and methods of producing P-alanine that involve 
contacting a composition comprising aspartate with an isolated Bacillus aspartate-a- 
decarboxylase enzyme under conditions such that p-alanine is produced. 

The production methods of the present invention further can include recovering 
the panto-compound (e.g., pantothenate or pantoate). 
35 The present invention further features recombinant microorganisms (e.g. , AaD- 

O microorganisms, microorganisms having a deregulated isoleucine-valine (z7v) 
pathway, microorganisms overexpressing at least one of ketopantoate 
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hydroxymethyltransferase (the panB gene product), pantothenate synthetase (the panC 
gene product), aspartate-a-decarboxylase (the panD gene product), ketopantoate 
reductase (the panEl gene product) and microorganisms having a deregulated panBCD 
operon. Also featured are panB, panC, panD, panE, ilvB, ilvN, alsS, ilvC, and/or ilvD 
5 nucleic acid molecules, as well as vectors including such nucleic acid molecules and 
gene products encoded by such nucleic acid molecules. 

The methodology of the present invention further includes, for example in 
addition to overexpressing at least one pantothenate biosynthetic enzyme, deleting or 
mutating a second pantothenate biosynthetic enzyme, said second pantothenate 
0 biosynthetic enzyme preferably being downstream of the desired product in the 
pantothenate biosynthetic pathway. For example, mutating panC, in addition to 
overexpressing the panE gene product, results in even further enhanced or increased 
production of pantoate. Accordingly, in one embodiment, the invention features a 
method of producing pantoate which includes culturing a microorganism which 
5 overexpresses the panE gene product and which has a deletion in the panC gene. In 
another embodiment, the invention features a method of producing pantoate which 
includes culturing a microorganism which overexpresses the panE gene product and/or 
panB gene product and which has a deletion in the panC gene. Other exemplary 
embodiments include a method of producing ketopantoate which includes culturing a 
0 microorganism which overexpresses the panB gene product and which has a deletion in 
the panE gene and a method of producing P-alanine which includes culturing a 
microorganism which overexpresses the panD gene product and which has a deletion in 
the panC gene. Also included are methods of producing panto-compounds which 
include overexpressing at least one valine biosynthetic enzyme in a microorganism 
5 which has at least one pantothenate biosynthetic enzyme deleted. 

The present invention is also based at least in part, on the identification and 
characterization of a previously unidentified microbial pantothenate kinase gene, coaX. 
CoaX was first identified in Bacillus subtilis and corresponds to an open reading frame 
in a portion of the chromosomal DNA that includes the 5' end of the ftsHgene, and all 
of theyacB, yacQ yacD, cysK and pabB genes. The present inventors have 
demonstrated that the yacB open reading frame encodes a novel pantothenate kinase 
activity, the gene being unrelated by homology to any previously known pantothenate 
kinase gene. The gene has been renamed coaX, as it encodes the enzyme which 
catalyzes the first step in the pathway from pantothenate to CoaA. 

Accordingly, the present invention features new and improved methods of 
producing pantothenate and other key compounds of the pantothenate biosynthetic 
pathway (e.g., panto-compounds) utilizing microorganisms having modified 
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pantothenate kinase activity. In particular, the present invention features recombinant 
microorganisms that contain the coaX gene or that contain a mutant coaX gene, having 
reduced pantothenate kinase activity. In one embodiment, the invention features such 
recombinant microorganisms further having a deregulated pantothenate biosynthetic 
5 pathway. In another embodiment, the invention features such recombinant 

microorganisms further having a deregulated isoleucine-valine (/7v) pathway. In a 
preferred embodiment, the microorganisms belong to the genus Bacillus {e.g., B. 
subtilis). 

The present invention also features recombinant microorganisms (e.g., 
10 microorganisms belonging to the genus Bacillus, for example, B. subtilis) that contain 
the coaA gene or that contain a mutant coaA gene, optionally including a coaX gene or 
mutant thereof, having reduced pantothenate kinase activity. In one embodiment, the 
invention features such recombinant microorganisms further having a deregulated 
pantothenate biosynthetic pathway or having a deregulated isoleucine-valine (/7v) 
1 5 pathway. 

Also featured are vectors that contain isolated coaXot coaA genes as well as 
mutant coaX and/or coaA genes. Isolated nucleic acid molecules that contain isolated 
coaA" genes or mutant coaX genes are featured in addition to isolated CoaX proteins and 
mutant CoaX proteins. 

20 The nucleic acids, vectors and recombinant microorganisms described above are 

particularly useful in the methodologies of the present invention. In particular, the 
invention features methods of enhancing panto-compound production (e.g., 
ketopantoate, pantoate and or pantothenate production) that include culturing a 
recombinant microorganism having a mutant coaX gene under conditions such that 

25 panto-compound production is enhanced. In one embodiment, the recombinant 
microorganism further includes a mutant coaA gene. In another embodiment, the 
recombinant microorganism further includes a mutant avtA and/or mutant ilvE gene 
and/or mutant ansB gene and/or mutant alsD gene. Also featured are methods for 
identifying pantothenate modulators utilizing the recombinant microorganisms and 

30 purified CoaX proteins of the present invention. 

Other features and advantages of the invention will be apparent from the 
following detailed description and claims. 
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Brief Description of the Drawings 

Figure 7 is a schematic representation of the pantothenate biosynthetic pathway. 

Figure 2 is a schematic representation of the plasmid pAN240, containing 
sequences ligated upstream of the P 26 panBCD cassette, equivalent to the integrated 
5 version in strain PA22 1 . 

Figure 3 A is a schematic representation of the plasmid pAN004, containing the 
panBCD operon expressed from P 26 and RBS 1 . 

Figure 3B is a schematic representation of the plasmid pAN006, containing the 
panBCD operon expressed from P26 and RBS2. 
10 Figure 4 is a schematic representation of the plasmid pAN236, containing an 

integratable and amplifiable P 2 6-RBS2-panEl expression cassette. 

Figure 5 is a schematic representation of the construction of plasmid pAN423. 

Figure 6 is a schematic representation of the construction of plasmids pAN426 
and pAN427. 

1 5 Figure 7 is a schematic representation of the construction of plasmids pAN428 

and pAN429. 

Figure 8 is a schematic representation of the construction of plasmid pAN431 . 
Figure 9 is a schematic representation of the construction of plasmid pAN441. 
Figure lOisz schematic representation of the construction of plasmid pAN440. 
20 Figure 11 is a schematic representation of the plasmid pAN25 1 designed to 

integrate a single copy of a P 26 -panEl cassette at the panEl locus by double crossover. 

Figure 12 is a schematic representation of the plasmid pAN267 designed to 
integrate a single copy of a P 2 6-ilvBNC cassette at the amyE locus. 

Figure 13 is a schematic representation of the plasmid pAN257, a clone of 
25 Bacillus subtilis HvD in a low copy vector. 

Figure 14 is a schematic representation of the plasmid pAN263, designed to 
integrate a single copy of a P 26 -ilvD cassette at the HvD locus. 

Figure 15 is a schematic representation of the plasmid pAN261, designed to 
disrupt the Bacillus subtilis HvD gene with the cat gene. 
30 Figure 16 is a schematic representation of the Coenzyme A biosynthetic pathway 

in E. coli. 

Figure 77 is a schematic representation of the structure of pAN296, a plasmid 
designed to delete most of the B. subtilis coaA gene and substitute a chloramphenicol 
resistance gene. 

35 Figure 18 is a schematic representation of the structure of the Bacillus subtilis 

genome in the region of the coaA gene. The scale is in base pairs and the significant 
open reading frames are shown by open arrows. 



WO 01/21772 



PCT/US00/25993 



-10- 

Figure 19 is a schematic representation of the plasmid pAN281, a plasmid for 
expressing Bacillus subtilis coaA after integration at the bpr locus. 

Figure 20A-B depicts a multiple sequence alignment (MSA) of the amino acid 
sequences encoded by six known or predicted microbial coaA genes. SEQ ID NOs:4-6 
5 and 1-3 correspond to the amino acid sequences of Mycobacterium leprae (SwissProt™ 
Accession No. Q9X795), Mycobacterium tuberculosis (SwissProt™ Accession No. 
053440), Streptomyces coelicolor (SwissProt™ Accession No. 086799), Haemophilus 
influenzae (SwissProt™ Accession No. P44793), Escherichia coli SwissProt™ 
Accession No. PI 5044) and Bacillus subtilis (SwissProt™ Accession No. P54556), 
10 respectively. The alignment was generated using ClustalW MSA software at the 
GenomeNet CLUSTALW Server at the Institute for Chemical Research, Kyoto 
University. The following parameters were used: Pairwise Alignment, K-tuple (word) 
size = 1, Window size = 5, Gap Penalty = 3, Number of Top Diagonals = 5, Scoring 
Method = Percent; Multiple Alignment, Gap Open Penalty = 10, Gap Extension Penalty 
15 = 0.0, Weight Transition = No, Hydrophilic residues = Gly, Pro, Ser, Asn, Asp, Gin, 
Glu, Arg and Lys, Hydrophobic Gaps = Yes; and Scoring Matrix = BLOSUM. 

Figure 21 is a schematic representation of the structure of the Bacillus subtilis 
genome in the region of the coaXiyacE) gene. The scale is in base pairs, the significant 
open reading frames are shown by open arrows and certain predicted restriction 
20 fragments are indicated by thick bars. 

Figure 22 is a schematic representation of the structure of pAN341 and pAN342, 
two independent PCR-derived clones of B. subtilis yacB (remaned herein as coaX). 

Figure 23A-D depicts a multiple sequence alignment (MSA) of the amino acid 
sequences encoded by fourteen known or predicted microbial coaX genes. SEQ ID 
25 NOs:9, 74, 7-8, 75, 1 1, 10 and 12-18 correspond to the amino acid sequences of Bacillus 
subtilis (SwissProt™ Accession No. P37564), Clostridium acetobulyticum (WIT™ 
Accession No. RCA03301, Argonne National Laboratories), Streptomyces coelicolor 
(PIR™ Accession No. T36391), Mycobacterium tuberculosis (SwissProt™ Accession 
No. 006282), Rhodobacter capsulatus (WIT™ Accession No. RRC02473), 
30 Desulfovibrio vulgaris (DBJ™ Accession No. BAA2 1 476. 1 ), Deinococcus radiodurans 
(SwissProt™ Accession No. Q9RX54), Thermotoga maritima (GenBank™ Accession 
No. AAD35964.1), Treponema pallidum (SwissProt™ Accession No. 083446), 
Borrelia burgdorferi (SwissProt™ Accession No.051477), Aquifex aeolicus 
(SwissProt™ Accession No. 067753), Synechocystis sp. (SwissProt™ Accession No. 
35 P74045), Helicobacter pylori (SwissProt™ Accession No. 025533), and Bordetella 
pertussis (SwissProt™ Accession No. Q45338), respectively. The alignment was 
generated using ClustalW MSA software at the GenomeNet CLUSTALW Server at the 
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Institute for Chemical Research, Kyoto University. The following parameters were 
used: Pairwise Alignment, K-tuple (word) size = 1, Window size = 5, Gap Penalty = 3, 
Number of Top Diagonals = 5, Scoring Method = Percent; Multiple Alignment, Gap 
Open Penalty = 10, Gap Extension Penalty = 0.0, Weight Transition = No, Hydrophilic 
5 residues = Gly, Pro, Ser, Asn, Asp, Gin, Glu, Arg and Lys, Hydrophobic Gaps = Yes; 
and Scoring Matrix = BLOSUM. 

Figure 24 depicts a multiple sequence alignment of a portion of the protein 
sequences of the coaA gene products from the following microorganisms: Bacillus 
subtilis, Escherichia coli, Haemophilus influenzae, Mycobacterium leprae, 

1 0 Mycobacterium tuberculosis, and Streptomyces coelicolor. The residues that are 

mutated in E. coli coaAJ5(Ts) and B. subtilis coaA282A are indicated below and above 
the alignment, respectively. The portions correspond to amino acid residues 168-1 87 of 
SEQIDNO:3, 167-186 ofSEQ IDNO:2, 165-184 ofSEQ IDNO:l, 169-188 ofSEQ 
ID NO:4, 169-188 of SEQ ID NO:5 and 179-198 of SEQ ID NO:6, respectively. 

15 Figure 25 is a schematic representation of the structure of pAN294, a plasmid 

for integrating mutagenized B. subtilis coaA at its native locus. 

Figure 26 is a schematic representation of the structure of pAN336, a plasmid 
designed to delete B. subtilis coaXirom its chromosomal locus and replace it with a 
kanamycin resistence gene. 

20 

Detailed Description of the Invention 

The present invention features new and improved methods of producing 
pantothenate and other key compounds of the pantothenate biosynthetic pathway 
(referred to herein as "panto-compounds", for example, pantothenate, ketopantoate, 
25 pantoate and (J-alanine) using microorganisms in which the pantothenate biosynthetic 
pathway has been manipulated such that pantothenate or other desired panto-compounds 
are produced. 

The new and improved methodologies of the present invention include methods 
of producing panto-compounds (e.g, pantothenate) in microorganisms having at least 

30 one enzyme of the pantothenate biosynthetic pathway manipulated such that 

pantothenate or other desired panto-compounds are produced (e.g, produced at an 
increased level). For example, the invention features methods of producing panto- 
compounds (e.g, pantothenate) in microorganisms having at least one of ketopantoate 
hydroxymethyltransferase, ketopantoate reductase, pantothenate synthetase or aspartate- 

35 a-decarboxylase manipulated such that pantothenate or other desired panto-compounds 
are produced. The methodologies of the present invention also include methods of 
producing panto-compounds (e.g, pantothenate) in microorganisms having at least one 
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valine-isoleucine biosynthetic enzyme, described herein, manipulated such that 
pantothenate or other desired panto-compounds are produced. For example, the 
invention features methods of producing panto-compounds (e.g., pantothenate) in 
microorganisms having at least one of acetohydroxyacid synthetase, acetohydroxyacid 
5 isomeroreductase or dihydroxyacid dehydratase manipulated such that pantothenate or 
other desired panto-compounds are produced. 

The invention also features methods of producing panto-compounds that involve 
culturing a ketopantoate reductase-overexpressing (KPAR-O) microorganism under 
conditions such that the panto-compound is produced. The invention also features 

1 0 methods of producing pantothenate in a manner independent of precursor feed that 

involve culturing an aspartate-ct-decarboxylase-overexpressing (AaD-O) microorganism 
under conditions such that pantothenate is produced. Also featured are P-alanine 
independent high yield pantothenate production methods as well as methods of 
producing P-alanine. The present invention also features methods for enhancing 

1 5 production of panto-compounds that involve culturing pantothenate kinase mutants. In 
particular, the present invention features new and improved methods of producing 
pantothenate and other key compounds of the pantothenate biosynthetic pathway (e.g., 
panto-compounds) utilizing microorganisms having modified pantothenate kinase 
activity, for example, microorganisms that include the coaX gene or that include a 

20 mutant coaX gene, having reduced pantothenate kinase activity. 

In order that the present invention may be more readily understood, certain terms 
are first defined herein. 

The term "pantothenate biosynthetic pathway" includes the biosynthetic pathway 
involving pantothenate biosynthetic enzymes (e.g., polypeptides encoded by 

25 biosynthetic enzyme-encoding genes), compounds (e.g., precursors, substrates, 

intermediates or products), cofactors and the like utilized in the formation or synthesis 
of pantothenate. The term "pantothenate biosynthetic pathway" includes the 
biosynthetic pathway leading to the synthesis of pantothenate in a microorganisms (e.g., 
in vivo) as well as the biosynthetic pathway leading to the synthesis of pantothenate in 

30 vitro. Figure 1 includes a schematic representation of the pantothenate biosynthetic 
pathway. Pantothenate biosynthetic enzymes are depicted in bold and their 
corresponding genes indicated in italics. 

The term "pantothenate biosynthetic enzyme" includes any enzyme utilized in 
the formation of a compound (e.g., intermediate or product) of the pantothenate 

35 biosynthetic pathway. According to Figure 1, synthesis of pamoate from a- 

ketoisovalerate (a-KIV) proceeds via the intermediate, ketopantoate. Formation of 
ketopantoate is catalyzed by the pantothenate biosynthetic enzyme ketopantoate 
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hydroxymethyltransferase (the panB gene product). Formation of pantoate is catalyzed 
by the pantothenate biosynthetic enzyme ketopantoate reductase (the panE gene 
product). Synthesis of p-alanine from aspartate is catalyzed by the pantothenate 
biosynthetic enzyme aspartate-a-decarboxylase (the panD gene product). Formation of 
5 pantothenate from pantoate and P-alanine (e.g. , condensation) is catalyzed by the 
pantothenate biosynthetic enzyme pantothenate synthetase (the panC gene product). 

The term "isoleucine-valine biosynthetic pathway" includes the biosynthetic 
pathway involving isoleucine-valine biosynthetic enzymes (e.g., polypeptides encoded 
by biosynthetic enzyme-encoding genes), compounds (e.g., precursors, substrates, 

10 intermediates or products), cofactors and the like utilized in the formation or synthesis 
of conversion of pyruvate to valine or isoleucine. The term "isoleucine-valine 
biosynthetic pathway" includes the biosynthetic pathway leading to the synthesis of 
valine or isoleucine in a microorganisms (e.g., in vivo) as well as the biosynthetic 
pathway leading to the synthesis of valine or isoleucine in vitro. Figure 1 includes a 

15 schematic representation of the isoleucine-valine biosynthetic pathway. Isoleucine- 
valine biosynthetic enzymes are depicted in bold italics and their corresponding genes 
indicated in italics 

The term "isoleucine-valine biosynthetic enzyme" includes any enzyme utilized 
in the formation of a compound (e.g., intermediate or product) of the isoleucine-valine 

20 biosynthetic pathway. According to Figure 1 , synthesis of valine from pyruvate 

proceeds via the intermediates, acetolactate, a,P-dihydroxyisovalerate (a,P-DHIV) and 
a-ketoisovalerate (a-KIV). Formation of acetolactate from pyruvate is catalyzed by the 
isoleucine-valine biosynthetic enzyme acetohydroxyacid synthetase (the ilvBN gene 
product, or alternatively, the alsS gene product). Formation of a,p-DHIV from 

25 acetolactate is catalyzed by the isoleucine-valine biosynthetic enzyme 

acetohydroxyacidisomero reductase (the ilvC gene product). Synthesis of a-KIV from 
a,p-DHIV is catalyzed by the isoleucine-valine biosynthetic enzyme dihydroxyacid 
dehydratase (the ilvD gene product). Moreover, valine and isoleucine can be 
interconverted by branched chain amino acid transaminases. 

30 As used herein, each of ketopantoate, pantoate, p-alanine and pantothenate are 

"panto-compounds". The term "panto-compound" includes a compound (e.g., a 
substrate, intermediate or product) in the pantothenate biosynthetic pathway which is 
downstream from a particular pantothenate biosynthetic enzyme. In one example, a 
panto-compound is downstream of the pantothenate biosynthetic enzyme ketopantoate 

35 hydroxymethyl transferase (the panB gene product) and can include ketopantoate, 

pantoate and/or pantothenate. In another example, a panto-compound is downstream of 
the pantothenate biosynthetic enzyme ketopantoate reductase (the panE gene product) 



WO 01/21772 



PCT/US00/25993 



-14- 

and can include pantoate and/or pantothenate. In yet another example, a panto- 
compound is downstream of the pantothenate biosynthetic enzyme pantothenate 
synthetase (the panC gene product) and can include pantothenate. In yet another 
example, a panto-compound is downstream of the pantothenate biosynthetic enzyme 
5 aspartate-a-decarboxylase (the panD gene product) and can include P-alanine and/or 
pantothenate. 

Preferred panto-compounds include pantothenate and pantoate. The term 
"pantothenate" includes the free acid form of pantothenate, also referred to as 
"pantothenic acid" as well as any salt thereof (e.g. , derived by replacing the acidic 

10 hydrogen of pantothenate or pantothenic acid with a cation, for example, calcium, 
sodium, potassium, ammonium), also referred to as a "pantothenate salt". The term 
"panto-compound" also includes alcohol derivatives of pantothenate. Preferred 
pantothenate salts are calcium pantothenate or sodium pantothenate. A preferred 
alcohol derivative is pantothenol. Pantothenate salts and/or alcohols of the present 

15 invention include salts and/or alcohols prepared via conventional methods from the free 
acids described herein. In another embodiment, calcium pantothenate is synthesized 
directly by a microorganism of the present invention. A pantothenate salt of the present 
invention can likewise be converted to a free acid form of pantothenate or pantothenic 
acid by conventional methodology. 

20 The term "pantoate" includes the free acid form of pantoate, also referred to as 

"pantoic acid" as well as any salt thereof (e.g., derived by replacing the acidic hydrogen 
of pantoate or pantoic acid with a cation, for example, calcium, sodium, potassium, 
ammonium), also referred to as a "pantoate salt". Preferred pantoate salts are calcium 
pantoate or sodium pantoate. Pantoate salts of the present invention include salts 

25 prepared via conventional methods from the free acids described herein. A pantoate salt 
of the present invention can likewise be converted to a free acid form of pantoate or 
pantoic acid by conventional methodology. Moreover, a free acid form of pantoate or 
pantoic acid can be converted to pantolactone by conventional methodology. 

The term "CoA biosynthetic pathway" includes the biosynthetic pathway 

30 involving CoA biosynthetic enzymes (e.g., polypeptides encoded by biosynthetic 
enzyme-encoding genes), compounds (e.g., precursors, substrates, intermediates or 
products), cofactors and the like utilized in the formation or synthesis of CoA from 
pantothenate. A schematic representation of the CoA biosynthetic pathway in E. coli is 
set forth as Figure 16. (The pathway depicted is also presumed to be that utilized by 

35 other microorganisms.) The term "CoA biosynthetic pathway" includes the biosynthetic 
pathway leading to the synthesis of CoA in microorganisms (e.g., in vivo) as well as the 
biosynthetic pathway leading to the synthesis of CoA in vitro. The term "Coenzyme A 
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or CoA biosynthetic enzyme" includes any enzyme utilized in the formation of a 
compound (e.g., intermediate or product) of the CoA biosynthetic pathway, for example, 
the coaA,panK or coaX gene product which catalyzes the phosphorylation of 
pantothenate to form 4*-phosphopantothenate, or the coaD gene product which catalyzes 
5 the conversion of 4'-phosphopantetheine to dephosphocoenzyme A. 

/. Recombinant Microorganisms and Methods for Culturing 
Microorganisms Such That Panto-Compounds are Produced 

The methodologies of the present invention feature microorganisms, e.g., 

10 recombinant microorganisms, preferably including vectors or genes (e.g., wild-type 
and/or mutated genes) as described herein and/or cultured in a manner which results in 
the production of a desired product (e.g. a panto-compound or panto-compounds). The 
term "recombinant" microorganism includes a microorganism (e.g., bacteria, yeast cell, 
fungal cell, etc.) which has been genetically altered, modified or engineered (e.g., 

1 5 genetically engineered) such that it exhibits an altered, modified or different genotype 
and/or phenotype (e.g., when the genetic modification affects coding nucleic acid 
sequences of the microorganism) as compared to the naturally-occurring microorganism 
from which it was derived. Preferably, a "recombinant" microorganism of the present 
invention has been genetically engineered such that it overexpresses at least one 

20 bacterial gene or gene product (e.g., a pantothenate or isoleucine-valine biosynthetic 

enzyme encoding-gene) as described herein, preferably a biosynthetic enzyme encoding- 
gene included within a recombinant vector as described herein and/or a biosynthetic 
enzyme expressed from a recombinant vector. The ordinary skilled will appreciate that 
a microorganism expressing or overexpressing a gene product produces or overproduces 

25 the gene product as a result of expression or overexpression of nucleic acid sequences 
and/or genes encoding the gene product. 

The term "manipulated microorganism" includes a microorganism that has been 
engineered (e.g., genetically engineered) or modified such that the microorganism has at 
least one enzyme of the pantothenate biosynthetic pathway and/or at least one enzyme of 

30 the isoleucine-valine biosynthetic pathway modified such that pantothenate or other 
desired panto-compounds are produced. Modification or engineering of such 
microorganisms can be according to any methodology described herein including, but 
not limited to, deregulation of a biosynthetic pathway and/or overexpression of at least 
one biosynthetic enzyme. A "manipulated" enzyme (e.g., a "manipulated" biosynthetic 

35 enzyme) includes an enzyme, the expression or production of which has been altered or 
modified such that at least one upstream or downstream precursor, substrate or product 
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of the enzyme is altered or modified, for example, as compared to a corresponding wild- 
type or naturally occurring enzyme. 

The term "overexpressed" or "overexpression" includes expression of a gene 
product (e.g., a pantothenate biosynthetic enzyme or isoleucine- valine biosynthetic 
5 enzyme) at a level greater than that expressed prior to manipulation of the 

microorganism or in a comparable microorganism which has not been manipulated. In 
one embodiment, the microorganism can be genetically manipulated (e.g., genetically 
engineered) to overexpress a level of gene product greater than that expressed prior to 
manipulation of the microorganism or in a comparable microorganism which has not 

10 been manipulated. Genetic manipulation can include, but is not limited to, altering or 
modifying regulatory sequences or sites associated with expression of a particular gene 
(e g, by adding strong promoters, inducible promoters or multiple promoters or by 
removing regulatory sequences such that expression is constitutive), modifying the 
chromosomal location of a particular gene, altering nucleic acid sequences adjacent to a 

1 5 particular gene such as a ribosome binding site or transcription terminator, increasing 
the copy number of a particular gene, modifying proteins (e.g., regulatory proteins, 
suppressors, enhancers, transcriptional activators and the like) involved in transcription 
of a particular gene and/or translation of a particular gene product, or any other 
conventional means of deregulating expression of a particular gene routine in the art 

20 (including but not limited to use of antisense nucleic acid molecules, for example, to 
block expression of repressor proteins). 

In another embodiment, the microorganism can be physically or environmentally 
manipulated to overexpress a level of gene product greater than that expressed prior to 
manipulation of the microorganism or in a comparable microorganism which has not 

25 been manipulated. For example, a microorganism can be treated with or cultured in the 
presence of an agent known or suspected to increase transcription of a particular gene 
and/or translation of a particular gene product such that transcription and/or translation 
are enhanced or increased. Alternatively, a microorganism can be cultured at a 
temperature selected to increase transcription of a particular gene and/or translation of a 

30 particular gene product such that transcription and/or translation are enhanced or 
increased. 

The term "deregulated" or "deregulation" includes the alteration or modification 
of at least one gene in a microorganism that encodes an enzyme in a biosynthetic 
pathway, such that the level or activity of the biosynthetic enzyme in the microorganism 
35 is altered or modified. Preferably, at least one gene that encodes an enzyme in a 

biosynthetic pathway is altered or modified such that the gene product is enhanced or 
increased. The phrase "deregulated pathway" can also include a biosynthetic pathway in 
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which more than one gene that encodes an enzyme in a biosynthetic pathway is altered 
or modified such that the level or activity of more than one biosynthetic enzyme is 
altered or modified. The ability to "deregulate" a pathway (e.g., to simultaneously 
deregulate more than one gene in a given biosynthetic pathway) in a microorganism 
5 arises from the particular phenomenon of microorganisms in which more than one 
enzyme (e.g., two or three biosynthetic enzymes) are encoded by genes occurring 
adjacent to one another on a contiguous piece of genetic material termed an "operon". 

The term "operon" includes a coordinated unit of gene expression that contains a 
promoter and possibly a regulatory element associated with one or more, preferably at 

10 least two, structural genes (e.g., genes encoding enzymes, for example, biosynthetic 
enzymes). Expression of the structural genes can be coordinately regulated, for 
example, by regulatory proteins binding to the regulatory element or by anti-termination 
of transcription. The structural genes can be transcribed to give a single mRNA that 
encodes all of the structural proteins. Due to the coordinated regulation of genes 

15 included in an operon, alteration or modification of the single promoter and/or 

regulatory element can result in alteration or modification of each gene product encoded 
by the operon. Alteration or modification of the regulatory element can include, but is 
not limited to removing the endogenous promoter and/or regulatory element(s), adding 
strong promoters, inducible promoters or multiple promoters or removing regulatory 

20 sequences such that expression of the gene products is modified, modifying the 

chromosomal location of the operon, altering nucleic acid sequences adjacent to the 
operon or within the operon such as a ribosome binding site, increasing the copy number 
of the operon, modifying proteins (e.g., regulatory proteins, suppressors, enhancers, 
transcriptional activators and the like) involved in transcription of the operon and/or 

25 translation of the gene products of the operon, or any other conventional means of 
deregulating expression of genes routine in the art (including but not limited to use of 
antisense nucleic acid molecules, for example, to block expression of repressor 
proteins). Deregulation can also involve altering the coding region of one or more genes 
to yield, for example, an enzyme that is feedback resistant or has a higher or lower 

30 specific activity. 

A particularly preferred "recombinant" microorganism of the present invention 
has been genetically engineered to overexpress a bacterially-derived gene or gene 
product. The term "bacterially-derived" or "derived-from", for example bacteria, 
includes a gene which is naturally found in bacteria or a gene product (e.g., ketopantoate 

35 hydroxymethyltransferase, ketopantoate reductase, pantothenate synthetase, aspartate-a- 
decarboxylate, acetohydroxyacid synthetase, acetohydroxyacid isomeroreductase or 
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dihydroxyacid dehydratase) which is encoded by a bacterial gene (e.g., encoded by 
panB, panE, panC, panD, ilvB, ilvN, alsS, ilvC, or ilvD). 

The methodologies of the present invention feature recombinant microorganisms 
which overexpress at least one of ketopantoate hydroxymethyltransferase, ketopantoate 
5 reductase, pantothenate synthetase or aspartate-a-decarboxylase. A particularly 
preferred recombinant microorganism of the present invention has been genetically 
engineered to overexpress a Bacillus (e.g., Bacillus licheniformis, Bacillus 
amyloliquefaciens, Bacillus halodurans, Bacillus subtilis, and Bacillus pumilus, etc.) 
biosynthetic enzyme (e.g., has been engineered to overexpress at least one of B. subtilis 

1 0 ketopantoate reductase (the panE gene product) (e.g., ketopantoate reductase having the 
amino acid sequence of SEQ ID NO:30 or encoded by the nucleic acid sequence of SEQ 
ID NO:29), B. subtilis ketopantoate hydroxymethyltransferase (the panB gene product) 
(e.g., ketopantoate hydroxymethyltransferase having the amino acid sequence of SEQ 
ID NO:24 or encoded by a nucleic acid molecule having the nucleotide sequence of 

1 5 SEQ ID NO:23), B. subtilis pantothenate synthetase (the panC gene product) (e.g. , 

pantothenate synthetase having the amino acid sequence of SEQ ID NO:26 or encoded 
by a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:25) and/or B. 
subtilis aspartate-a-decarboxylase (the panD gene product) (e.g., aspartate-a- 
decarboxylase having the amino acid sequence of SEQ ID NO:28 or encoded by a 

20 nucleic acid molecule having the nucleotide sequence of SEQ ID NO:27). 

In an exemplary embodiment, the invention features a microorganism (e.g., a 
KPAR-0 microorganism) that has been transformed with a vector comprising a panE 
nucleic acid sequence (e.g., a panE nucleic acid seqeunce as set forth in SEQ ID 
NO:29). In another embodiment, the invention features a microorganism that has been 

25 transformed with a vector comprising a panB nucleic acid sequence (e.g. , a panB nucleic 
acid sequence as set forth in SEQ ID NO:23), a vector comprising a panC nucleic acid 
sequence (e.g., a panC nucleic acid sequence as set forth in SEQ ID NO:25) or a vector 
comprising a panD nucleic acid sequence (e.g., a panD nucleic acid sequence as set 
forth in SEQ ID NO:27). In yet another embodiment, the invention features a 

30 microorganism having a deregulated panBCD operon (e.g. , SEQ ID NO:59). 

Other preferred "recombinant" microorganisms of the present invention have a 
deregulated isoleucine-valine (ilv) pathway. The phrase "microorganism having a 
deregulated isoleucine-valine (/7v) pathway" includes a microorganism having an 
alteration or modification in at least one gene encoding an enzyme of the isoleucine- 

35 valine (/7v) pathway or having an alteration or modification in an operon including more 
than one gene encoding an enzyme of the isoleucine-valine (/7v) pathway. A preferred 
"microorganism having a deregulated isoleucine-valine (/7v) pathway" has been 
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genetically engineered to overexpress a Bacillus (e.g., B. subtilis) ilv biosynthetic 
enzyme (e.g., has been engineered to overexpress at least one of acetohydroxyacid 
synthetase (the ilvBN gene products or the alsS gene product) (e.g., acetohydroxyacid 
synthetase having subunits having the amino acid sequences of SEQ ID NO:32 and SEQ 
5 ID NO:34 or encoded by nucleic acid molecules having the nucleotide sequence of SEQ 
ID NO:31 and SEQ ID NO:33 or the nucleotide sequence of SEQ ID NO:58 from 
nucleotides 1-2246 or acetohydroxyacid synthetase encoded by a nucleic acid molecule 
having the nucleotide sequence of SEQ ID NO: 86), acetohydroxyacid isomeroreductase 
(the ilvC gene product) (e.g. , acetohydroxyacid isomeroreductase having the amino acid 

10 sequence of SEQ ID NO:36 or encoded by a nucleic acid molecule having the 

nucleotide sequence of SEQ ID NO:35), dihydroxyacid dehydratase (the ilvD gene 
product) (e.g., dihydroxyacid dehydratase having the amino acid sequence of SEQ ID 
NO:38 or encoded by a nucleic acid molecule having the nucleotide sequence of SEQ 
ID NO:37), and/or has been transformed with a vector comprising an ilvBNC nucleic 

15 acid sequence (SEQ ID NO:58, coding regions from nucleotides 1-1725, 1722-2246 and 
2263-3291) and/or an ilvD nucleic acid sequence (SEQ ID NO:37). 

In another preferred embodiment, a recombinant microorganism is designed or 
engineered such that a mutant CoaA and/or CoaX biosynthetic enzyme is expressed and 
at least one pantothenate biosynthetic enzyme and/or at least one isoleucine-valine 

20 biosynthetic enzyme is overexpressed or deregulated. 

In another preferred embodiment, a microorganism of the present invention 
overexpresses or is mutated for a gene or biosynthetic enzyme (e.g., a Co A biosynthetic 
enzyme, pantothenate biosynthetic enzyme or isoleucine-valine biosynthetic enzyme) 
which is bacterially-derived. The term "bacterially-derived" or "derived-from", for 

25 example bacteria, includes a gene product (e.g., ketopantoate hydroxymethyltransferase, 
ketopantoate reductase, pantothenate synthetase, aspartate-a-decarboxylate, 
acetohydroxyacid synthetase, acetohydroxyacid isomeroreductase, dihydroxyacid 
dehydratase or pantothenate kinase) which is encoded by a bacterial gene (e.g.,panB, 
panE, panC,panD, ilvBN (or alsS), ilvC, ilvD, or encoded by coaA or coaX). 

30 Still other preferred recombinant microorganisms of the present invention are 

mutant microorganisms. As used herein, the term "mutant microorganism" includes a 
recombinant microorganism that has been genetically engineered to express a mutated 
gene or protein that is normally or naturally expressed by the microorganism. 
Preferably, a mutant microorganism expresses a mutated gene or protein such that the 

35 microorganism exhibits an altered, modified or different phenotype (e.g., has been 

engineered to express a mutated CoaA biosynthetic enzyme, for example, pantothenate 
kinase). In one embodiment, a mutant microorganism is designed or engineered such 
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that it includes a mutant coaX gene, as defined herein. In another embodiment, a 
recombinant microorganism is designed or engineered such that it includes a mutant 
coaA gene, as defined herein. In another embodiment, a mutant microorganism is 
designed or engineered such that a coaX gene has been deleted (i.e., the protein encoded 
5 by the coaA' gene is not produced). In another embodiment, a mutant microorganism is 
designed or engineered such that a coaA gene has been deleted (i.e., the protein encoded 
by the coaA gene is not produced). Preferably, a mutant microorganism has a mutant 
coaX gene or a mutant coaA gene, or has been engineered to have a coaX gene and/or 
coaA deleted, such that that the mutant microorganism encodes a "reduced pantothenate 

10 kinase activity". In the context of a whole microorganism, a "reduced pantothenate 
kinase activity" can be determined by measuring or assaying for a decrease in an 
intermediate or product of the CoA biosynthetic pathway, for example, measuring or 
assaying for 4'-phosphopantothenate, 4'-phosphopantothenylcysteine, 4'- 
phosphopantetheine, dephosphocoenzyme A, Coenzyme A, apo-acyl carrier protein 

15 (apo-ACP) or holo-acyl carrier protein (ACP) in the microorganism (e.g., in a lysate 
isolated or derived from the microorganism) or in the medium in which the 
microorganism is cultured (see e.g., Figure 16). Alternatively, a "reduced pantothenate 
kinase activity" can be determined by measuring or assaying for decreased growth of the 
microorganism. Alternatively, a "reduced pantothenate kinase activity" can be 

20 determined by measuring or assaying for an increase in a panto-compound (e.g., 
pantothenate) in the microorganism or surrounding media, as panto-compounds lie 
upstream of the CoA biosynthetic pathway, the first step of which is catalyzed by 
pantothenate kinase. The invention also features recombinant microorganisms that, in 
addition to having reduced pantothenate kinase activity (e.g., expressing mutant coaA 

25 and/or mutant coaX genes) have a deregulated pantothenate biosynthesis pathway and/or 
a deregulated isoleucine-valine (ilv) biosynthetic pathway. 

In one embodiment, a recombinant microorganism of the present invention is a 
Gram positive organism (e.g., a microorganism which retains basic dye, for example, 
crystal violet, due to the presence of a Gram-positive wall surrounding the 

30 microorganism). In a preferred embodiment, the recombinant microorganism is a 
microorganism belonging to a genus selected from the group consisting of Bacillus, 
Cornyebacterium, Lactobacillus, Lactococci and Sireptomyces. In a more preferred 
embodiment, the recombinant microorganism is of the genus Bacillus. In another 
preferred embodiment, the recombinant microorganism is selected from the group 

35 consisting of Bacillus subtilis, Bacillus lentimorbus, Bacillus lentus, Bacillus firmus, 
Bacillus pantothenticus, Bacillus amyloliquefaciens, Bacillus cereus, Bacillus circulans, 
Bacillus coagulans, Bacillus licheniformis, Bacillus megaterium, Bacillus pumilus, 
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Bacillus thuringiensis, and other Group 1 Bacillus species, for example, as characterized 
by 16S rRNA type (Priest (1993) in Bacillus subtilis and Other Gram-Positive Bacteria 
eds. Sonenshein et ai, ASM, Washington, D.C., p. 6). In another preferred 
embodiment, the recombinant microorganism is Bacillus brevis or Bacillus 
5 stearothermophilus. In another preferred embodiment, the recombinant microorganism 
is selected from the group consisting of Bacillus licheniformis, Bacillus 
amyloliquefaciens, Bacillus halodurans, Bacillus subtilis, and Bacillus pumilus. 

In another embodiment, the recombinant microorganism is a Gram negative 
(excludes basic dye) organism. In a preferred embodiment, the recombinant 

10 microorganism is a microorganism belonging to a genus selected from the group 
consisting of Salmonella, Escherichia, Klebsiella, Serratia, and Proteus. In a more 
preferred embodiment, the recombinant microorganism is of the genus Escherichia. In 
an even more preferred embodiment, the recombinant microorganism is Escherichia 
coli. In another embodiment, the recombinant microorganism is Saccharomyces (e.g., S. 

1 5 cerevisiae). 

An important aspect of the present invention involves culturing the recombinant 
microorganisms described herein, such that a desired compound (e.g., a desired panto- 
compound) is produced. The term "culturing" includes maintaining and/or growing a 
living microorganism of the present invention (e.g. , maintaining and/or growing a 

20 culture or strain). In one embodiment, a microorganism of the invention is cultured in 
liquid media. In another embodiment, a microorganism of the invention is cultured in 
solid media or semi-solid media. In a preferred embodiment, a microorganism of the 
invention is cultured in media (e.g., a sterile, liquid media) comprising nutrients 
essential or beneficial to the maintenance and/or growth of the microorganism (e.g., 

25 carbon sources or carbon substrate, for example complex carbohydrates such as bean or 
grain meal, starches, sugars, sugar alcohols, hydrocarbons, oils, fats, fatty acids, organic 
acids and alcohols; nitrogen sources, for example, vegetable proteins, peptones, peptides 
and amino acids derived from grains, beans and tubers, proteins, peptides and amino 
acids derived form animal sources such as meat, milk and animal byproducts such as 

30 peptones, meat extracts and casein hydrolysates; inorganic nitrogen sources such as 
urea, ammonium sulfate, ammonium chloride, ammonium nitrate and ammonium 
phosphate; phosphorus sources, for example, phosphoric acid, sodium and potassium 
salts thereof; trace elements, for example, magnesium, iron, manganese, calcium, 
copper, zinc, boron, molybdenum, and/or cobalt salts; as well as growth factors such as 

35 amino acids, vitamins, growth promoters and the like). 
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Preferably, microorganisms of the present invention are cultured under 
controlled pH. The term "controlled pH" includes any pH which results in production of 
the desired product (e.g. , a panto-compound). In one embodiment, microorganisms are 
cultured at a pH of about 7. In another embodiment, microorganisms are cultured at a 
5 pH of between 6.0 and 8.5. The desired pH may be maintained by any number of 
methods known to those skilled in the art. 

Also preferably, microorganisms of the present invention are cultured under 
controlled aeration. The term "controlled aeration" includes sufficient aeration (e.g., 
oxygen) to result in production of the desired product (e.g., panto-compound). In one 

1 0 embodiment, aeration is controlled by regulating oxygen levels in the culture, for 
example, by regulating the amount of oxygen dissolved in culture media. Preferably, 
aeration of the culture is controlled by agitating the culture. Agitation may be provided 
by a propeller or similar mechanical agitation equipment, by revolving or shaking the 
growth vessel (e.g., fermentor) or by various pumping equipment. Aeration may be 

1 5 further controlled by the passage of sterile air or oxygen through the medium (e.g., 
through the fermentation mixture). Also preferably, microorganisms of the present 
invention are cultured without excess foaming (e.g., via addition of antifoaming agents). 

Moreover, microorganisms of the present invention can be cultured under 
controlled temperatures. The term "controlled temperature" includes any temperature 

20 which results in production of the desired product (e.g., a panto-compound). In one 

embodiment, controlled temperatures include temperatures between 15°C and 95°C. In 
another embodiment, controlled temperatures include temperatures between 15°C and 
70°C. Preferred temperatures are between 20°C and 55°C, more preferably between 
30°C and 45°C or between 30°C and 50°C. 

25 Microorganisms can be cultured (e.g., maintained and/or grown) in liquid media 

and preferably are cultured, either continuously or intermittently, by conventional 
culturing methods such as standing culture, test tube culture, shaking culture (e.g., rotary 
shaking culture, shake flask culture, etc.), aeration spinner culture, or fermentation. In a 
preferred embodiment, the microorganisms are cultured in shake flasks. In a more 

30 preferred embodiment, the microorganisms are cultured in a fermentor (e.g., a 

fermentation process). Fermentation processes of the present invention include, but are 
not limited to, batch, fed-batch and continuous methods of fermentation. The phrase . 
"batch process" or "batch fermentation" refers to a closed system in which the 
composition of media, nutrients, supplemental additives and the like is set at the 

35 beginning of the fermentation and not subject to alteration during the fermentation, 

however, attempts may be made to control such factors as pH and oxygen concentration 
to prevent excess media acidification and/or microorganism death. The phrase "fed- 
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batch process" or "fed-batch" fermentation refers to a batch fermentation with the 
exception that one or more substrates or supplements are added (e.g., added in 
increments or continuously) as the fermentation progresses. The phrase "continuous 
process" or "continuous fermentation" refers to a system in which a defined 
5 fermentation media is added continuously to a fermentor and an equal amount of used or 
"conditioned" media is simultaneously removed, preferably for recovery of the desired 
product (e.g., panto-compound). A variety of such processes have been developed and 
are well-known in the art. 

The phrase "culturing under conditions such that a desired compound (e.g., a 

10 panto-compound, for example, pantothenate) is produced" includes maintaining and/or 
growing microorganisms under conditions (e.g., temperature, pressure, pH, duration, 
etc.) appropriate or sufficient to obtain production of the desired compound or to obtain 
desired yields of the particular compound being produced. For example, culturing is 
continued for a time sufficient to produce the desired amount of a panto-compound (e.g., 

1 5 pantothenate, pantoate or 0-alanine). Preferably, culturing is continued for a time 
sufficient to substantially reach maximal production of the panto-compound. In one 
embodiment, culturing is continued for about 12 to 24 hours. In another embodiment, 
culturing is continued for about 24 to 36 hours, 36 to 48 hours, 48 to 72 hours, 72 to 96 
hours, 96 to 120 hours, 120 to 144 hours, or greater than 144 hours. In another 

20 embodiment, culturing is continued for a time sufficient to reach production yields of 
panto-compound, for example, cells are cultured such that at least about 15 to 20 g/L of 
panto-compound are produced, at least about 20 to 25 g/L panto-compound are 
produced, at least about 25 to 30 g/L panto-compound are produced, at least about 30 to 
35 g/L panto-compound are produced, at least about 35 to 40 g/L panto-compound are 

25 produced (e.g., at least about 37 g/L panto-compound) or at least about 40 to 50 g/L 
panto compound are produced. In yet another embodiment, microorganisms are 
cultured under conditions such that a preferred yield of panto-compound, for example, a 
yield within a range set forth above, is produced in about 24 hours, in about 36 hours, in 
about 48 hours, in about 72 hours, or in about 96 hours. 

30 The methodology of the present invention can further include a step of 

recovering a desired compound (e.g., a panto-compound). The term "recovering" a 
desired compound (e.g., a panto-compound) includes extracting, harvesting, isolating or 
purifying the compound from culture media. Recovering the compound can be 
performed according to any conventional isolation or purification methodology known 

35 in the art including, but not limited to, treatment with a conventional resin (e.g., anion or 
cation exchange resin, non-ionic adsorption resin, etc.), treatment with a conventional 
adsorbent (e.g., activated charcoal, silicic acid, silica gel, cellulose, alumina, etc.), 
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alteration of pH, solvent extraction (e.g., with a conventional solvent such as an alcohol, 
ethyl acetate, hexane and the like), dialysis, filtration, concentration, crystallization, 
recrystallization, pH adjustment, lyophilization and the like. For example, a compound 
(e.g., a panto-compound) can be recovered from culture media by first removing the 
5 microorganisms from the culture. Media is then passed through or over a cation 

exchange resin to remove unwanted cations and then through or over an anion exchange 
resin to remove unwanted inorganic anions and organic acids having stronger acidities 
than the panto-compound of interest (e.g., pantothenate). The resulting panto-compound 
(e.g., pantothenate) can subsequently be converted to a pantothenate salt (e.g., calcium 

10 pantothenate) as described herein. 

Preferably, a desired compound of the present invention is "extracted", 
"isolated" or "purified" such that the resulting preparation is substantially free of other 
components (e.g., free of media components and/or fermentation byproducts). The 
language "substantially free of other components" includes preparations of desired 

15 compound in which the compound is separated (e.g., purified or partially purified) from 
media components or fermentation byproducts of the culture from which it is produced. 
In one embodiment, the preparation has greater than about 80% (by dry weight) of the 
desired compound (e.g., less than about 20% of other media components or fermentation 
byproducts), more preferably greater than about 90% of the desired compound (e.g., less 

20 than about 10% of other media components or fermentation byproducts), still more 

preferably greater than about 95% of the desired compound (e.g., less than about 5% of 
other media components or fermentation byproducts), and most preferably greater than 
about 98-99% desired compound (e.g., less than about 1-2% other media components or 
fermentation byproducts). When the desired compound is a panto-compound that has 

25 been derivatized to a salt (e.g. a pantothenate salt or pantoate salt), the panto-compound 
is preferably further free (e.g., substantially free) of chemical contaminants associated 
with the formation of the salt. When the desired compound is a panto-compound that 
has been derivatized to an alcohol, the panto-compound is preferably further free (e.g., 
substantially free) of chemical contaminants associated with the formation of the 

30 alcohol. 

In an alternative embodiment, the desired panto-compound is not purified from 
the microorganism, for example, when the microorganism is biologically non-hazardous 
(e.g., safe). For example, the entire culture (or culture supernatant) can be used as a 
source of product (e.g. , crude product). In one embodiment, the culture (or culture 
35 supernatant) supernatant is used without modification. In another embodiment, the 

culture (or culture supernatant) is concentrated. In yet another embodiment, the culture 
(or culture supernatant) is dried or lyophilized. 
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II. Panto-Compound Production Methodologies Featuring Ketopantoate 
Reductase-Overexpressing Microorganisms 

One aspect of the invention features methods of producing a panto-compounds 
that involve culturing a ketopantoate reductase-overexpressing (KPAR-O) 
5 microorganism under conditions such that the panto-compound is produced- The term 
"ketopantoate reductase-overexpressing (KPAR-O) microorganism" includes a 
microorganism which has been manipulated such that ketopantoate reductase is 
overexpressed (e.g., a B. subtilis ketopantoate reductase protein having the amino acid 
sequence of SEQ ID NO:30) and/or has been transformed with a vector comprising a 
10 panEl nucleic acid sequence (e.g., a B. subtilis panEl nucleic acid sequence as set forth 
in SEQ ID NO:29). In one embodiment, the panto-compound is pantothenate. In 
another embodiment, the panto-compound is pantoate. In another embodiment, the 
ketopantoate reductase is bacterial-derived. In another embodiemnt, the ketopantoate 
reductase is derived from Bacillus (e.g., is derived from Bacillus subtilis). In yet 
15 another embodiment, the KPAR-O microorganism is Gram positive. In yet another 
embodiment, the KPAR-O microorganism is a microorganism belonging to a genus 
selected from the group consisting of Bacillus, Cornyebacterium, Lactobacillus, 
Lactococci and Streptomyces. In a preferred embodiemnt, the KPAR-O microorganism 
is of the genus Bacillus. In a more preferred embodiment, the KPAR-O microorganism 
20 is selected from the group consisting of Bacillus licheniformis, Bacillus 

amyloliquefaciens, Bacillus halodurans, Bacillus subtilis and Bacillus pumilus. In a 
particularly preferred embodiemnt, the KPAR-O microorganism is Bacillus subtilis. 

In still other embodiments, the KPAR-O microorganism further overexpresses at 
least one pantothenate biosynthetic enzyme in addition to ketopantoate reductase. In an 
25 exemplary embodiment, the KPAR-O microorganism further overexpresses at least one 
of ketopantoate hydroxymethyltransferase, pantothenate synthetase and aspartate-a- 
decarboxylase. Also featured are methods of producing panto-compounds, for example, 
methods that involve culturing a KPAR-O microorganism, which further include the 
step of recovering the panto-compound. 

30 

III Methods of Producing Panto-Compounds Independent of Precursor Feed 
Requirements 

Depending on the biosynthetic enzyme or combination of biosynthetic enzymes 
manipulated, it may be desirable or necessary to provide (e.g., feed) microorganisms of 
35 the present invention at least one pantothenate biosynthetic precursor such that 

pantothenate or other desired panto-compounds are produced. The term "pantothenate 
biosynthetic precursor" or "precursor" includes an agent or compound which, when 
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provided to, brought into contact with, or included in the culture medium of a 
microorganism, serves to enhance or increase pantothenate biosynthesis. In one 
embodiment, the pantothenate biosynthetic precursor or precursor is aspartate. In 
another embodiment, the pantothenate biosynthetic precursor or precursor is P-alanine. 
5 The amount of aspartate or p-alanine added is preferably an amount that results in a 
concentration in the culture medium sufficient to enhance productivity of the 
microorganism (e.g., a concentration sufficient to enhance production of a panto- 
compound, for example, P-alanine, ketopantoate, pantoate or pantothenate). 
Pantothenate biosynthetic precursors of the present invention can be added in the form 

10 of a concentrated solution or suspension (e.g., in a suitable solvent such as water or 
buffer) or in the form of a solid (e.g., in the form of a powder). Moreover, pantothenate 
biosynthetic precursors of the present invention can be added as a single aliquot, 
continuously or intermittently over a given period of time. 

In yet another embodiment, the pantothenate biosynthetic precursor is valine, see 

1 5 e.g. , Example III. In yet another embodiment, the pantothenate biosynthetic precursor is 
a-ketoisovalerate. Preferably, valine or a-ketoisovalerate is added in an amount that 
results in a concentration in the medium sufficient for production of the desired product 
(e.g., panto-compound) to occur. Pantothenate biosynthetic precursors are also referred 
to herein as "supplemental pantothenate biosynthetic substrates". 

20 Providing pantothenate biosynthetic precursors in the pantothenate biosynthetic 

methodologies of the present invention, can be associated with high costs, for example, 
when the methodologies are used to produce high yields of panto-compounds. 
Accordingly, preferred methodologies of the present invention feature microorganisms 
having at least one biosynthetic enzyme or combination of biosynthetic enzymes (e.g., at 

25 least one pantothenate biosynthetic enzyme and/or valine-isoleucine biosynthetic 
enzyme) manipulated such that pantothenate or other desired panto-compounds are 
produced in a manner independent of precursor feed. The phrase "a manner 
independent of precursor feed", for example, when referring to a method for producing a 
desired compound (e.g., a panto-compound), includes an approach to or a mode of 

30 producing the desired compound that does not depend or rely on precursors being 
provided (e.g., fed) to the microorganism being utilized to produce the desired 
compound. For example, microorganisms featured in the methodologies of the present 
invention can be used to produce panto-compounds in a manner requiring no feeding of 
the precursors aspartate, p-alanine, valine and/or a-KIV. 

35 Alternative preferred methodologies of the present invention feature 

microorganisms having at least one biosynthetic enzyme or combination of biosynthetic 
enzymes manipulated such that pantothenate or other desired panto-compounds are 
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produced in a manner substantially independent of precursor feed. The phrase "a 
manner substantially independent of precursor feed" includes an approach to or a 
method of producing the desired compound that depends or relies to a lesser extent on 
precursors being provided (e.g., fed) to the microorganism being utilized. For example, 
5 microorganisms featured in the methodologies of the present invention can be used to 
produce panto-compounds in a manner requiring feeding of substantially reduced 
amounts of the precursors aspartate, P-alanine, valine and/or a-KIV. In one 
embodiment, the invention features methods of producing panto-compounds (e.g., 
pantothenate) in a manner that requires feeding of less than 5%-10% of the amount of 

10 precursor required by a control microorganism (e.g., a microorganism that is dependent, 
for example is wholly dependent, on precursor feed to efficiently produce the desired 
compound). In another embodiment, the invention features methods of producing 
panto-compounds in a manner that requires feeding of less than 15-20% of the amount 
of precursor required by a control microorganism. In another embodiment, the 

1 5 invention features methods of producing panto-compounds in a manner that requires 
feeding of less than 25-30%, 35-40%, 45-50% or 55-60% of the amount of precursor 
required by a control microorganism. As described in Examples I-III herein, particular 
microorganisms featured in the methodologies of the present invention require, for 
example, 5 g/L of aspartate, P-alanine, valine or a-KIV (e.g., in test tube or in shake 

20 flask cultures). Accordingly, in a preferred embodiment, the present invention features 
methods of producing panto-compounds (e.g., pantothenate) in a manner requiring 
feeding of less than 0.25 g/L, 0.5 g/L, 0.75 g/L, 1 g/L, 1.25 g/L, 1.5 g/L, 1.75 g/L, 2 g/L, 
2.25 g/L, 2.5 g/L, 2.75 g/L or 3 g/L. 

Preferred methods of producing desired compounds (e.g., panto-compounds) in a 

25 manner independent of precursor feed or alternatively, in a manner substantially 
independent of precursor feed, involve culturing microorganisms which have been 
manipulated (e.g., designed or engineered, for example, genetically engineered) such 
that expression of at least one pantothenate biosynthetic enzyme, and/or at least one 
isoleucine-valine biosynthetic enzyme, is modified. For example, in one embodiment, a 

30 microorganism is manipulated (e.g., designed or engineered) such that the production of 
at least one pantothenate biosynthetic enzyme, and/or at least one isoleucine/valine 
biosynthetic enzyme is deregulated. In a preferred embodiment, a microorganism is 
manipulated (e.g., designed or engineered) such that it has a deregulated biosynthetic 
pathway, for example, a deregulated pantothenate biosynthesis pathway and/or a 

35 deregulated isoleucine-valine biosynthetic pathway, as defined herein. In another 
preferred embodiment, a microorganism is manipulated (e.g., designed or engineered) 
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such that at least one pantothenate biosynthetic enzyme, and/or at least one isoleucine- 
valine biosynthetic enzyme is overexpressed. 

Preferred methods of producing desired compounds {e.g., panto-compounds) in a 
manner independent of precursor feed or alternatively, in a manner substantially 
5 independent of precursor feed, are as follows. In one embodiment, the invention 

features a method of producing pantothenate in a manner independent of precursor feed 
comprising culturing an aspartate-a-decarboxylase-overexpressing (AaD-O) 
microorganism having a deregulated isoleucine-valine (z7v) pathway under conditions 
such that pantothenate is produced. In another embodiment, the invention features a 
1 0 method of producing pantothenate in a manner independent of precursor feed 
comprising culturing an aspartate-a-decarboxylase-overexpressing (AaD-O) 
microorganism having a deregulated pantothenate (pan) pathway and a deregulated 
isoleucine-valine (i/v) pathway, under conditions such that pantothenate is produced. In 
another embodiment, the invention features a method of producing pantothenate in a 
1 5 manner independent of aspartate or (i-alanine feed comprising culturing an aspartate-a- 
decarboxylase-overexpressing (AaD-O) microorganism under conditions such that 
pantothenate is produced. In yet another embodiment, the invention features a method 
of producing pantothenate in a manner independent of valine or a-ketoisovalerate feed 
comprising culturing a microorganism having a deregulated isoleucine-valine (z7v) 
20 biosynthetic pathway under conditions such that pantothenate is produced. 

The term "aspartate-a-decarboxylase-overexpressing (AaD-O) microorganism" 
includes a microorganism which has been manipulated such that aspartate-a- 
decarboxylase is overexpressed. A preferred "aspartate-a-decarboxylase-overexpressing 
(AaD-O) microorganism" has been transformed with a vector comprising a B. subtilis 
25 panD nucleic acid sequence (e.g., sl panD nucleic acid sequence that encodes an 

aspartate-a-decarboxylase protein having the amino acid sequence of SEQ ID NO:28, 
for example, a panD nucleic acid sequence as set forth in SEQ ID NO:27). 

The phrase "microorganism having a deregulated isoleucine-valine (z/v) 
pathway" includes a microorganism having an alteration or modification in at least one 
gene encoding an enzyme of the isoleucine-valine (/7v) pathway or having an alteration 
or modification in an operon including more than one gene encoding an enzyme of the 
isoleucine-valine (/7v) pathway. A preferred "microorganism having a deregulated 
isoleucine-valine (/7v) pathway" overexpresses acetohydroxyacid synthetase (e.g., 
acetohydroxyacid synthetase having subunits having the amino acid sequences of SEQ 
ID NO:32 and SEQ ID NO:34 or acetohydroxyacid synthetase having the amino acid 
sequence of SEQ ID NO:87), acetohydroxyacid isomeroreductase (having the amino 
acid sequence of SEQ ID NO:36), or dihydroxyacid dehydratase (having the amino acid 
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sequence of SEQ ID NO:38) and/or has been transformed with a vector comprising ilvB, 
ilvN, ilvC, ilvBN, ilvBNC or alsS nucleic acid sequences (SEQ ID NO:3 1 , SEQ ID 
NO:33, SEQ ID NO:35, nucleotides 1-2246 of SEQ ID NO:58, SEQ ID NO:58 having 
coding regions from nucleotides 1-1725, 1722-2246 and 2263-3291, or SEQ ID NO:86, 
5 respectively) and/or an ilvD nucleic acid sequence (SEQ ID NO:37). 

IV. High Yield Production Methodologies 

A particularly preferred embodiment of the present invention is a high yield 
production method for producing pantothenate comprising culturing a manipulated 

10 microorganism under conditions such that pantothenate is produced at a significantly 
high yield. The phrase "high yield production method", for example, a high yield 
production method for producing a desired compound (e.g., for producing a panto- 
compound) includes a method that results in production of the desired compound at a 
level which is elevated or above what is usual for comparable production methods. 

1 5 Preferably, a high yield production method results in production of the desired 

compound at a significantly high yield. The phrase "significantly high yield" includes a 
level of production or yield which is sufficiently elevated or above what is usual for 
comparable production methods, for example, which is elevated to a level sufficient for 
commercial production of the desired product (e.g., production of the product at a 

20 commercially feasible cost). In one embodiment, the invention features a high yield 
production method of producing pantothenate that includes culturing a manipulated 
microorganism under conditions such that pantothenate is produced at a level greater 
than 2 g/L. In another embodiment, the invention features a high yield production 
method of producing pantothenate that includes culturing a manipulated microorganism 

25 under conditions such that pantothenate is produced at a level greater than 10 g/L. In 
another embodiment, the invention features a high yield production method of 
producing pantothenate that includes culturing a manipulated microorganism under 
conditions such that pantothenate is produced at a level greater than 20 g/L. In yet 
another embodiment, the invention features a high yield production method of 

30 producing pantothenate that includes culturing a manipulated microorganism under 
conditions such that pantothenate is produced at a level greater than 30 g/L. In yet 
another embodiment, the invention features a high yield production method of 
producing pantothenate that includes culturing a manipulated microorganism under 
conditions such that pantothenate is produced at a level greater than 40 g/L. 

35 The invention further features a high yield production method for producing a 

desired compound (e.g., for producing a panto-compound) that involves culturing a 
manipulated microorganism under conditions such that a sufficiently elevated level of 
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compound is produced within a commercially desireable period of time. In an 
exemplary embodiment, the invention features a high yield production method of 
producing pantothenate that includes culturing a manipulated microorganism under 
conditions such that pantothenate is produced at a level greater than 15-20 g/L in 36 
5 hours. In another embodiment, the invention features a high yield production method of 
producing pantothenate that includes culturing a manipulated microorganism under 
conditions such that pantothenate is produced at a level greater than 25-30 g/L in 48 
hours. In another embodiment, the invention features a high yield production method of 
producing pantothenate that includes culturing a manipulated microorganism under 

10 conditions such that pantothenate is produced at a level greater than 35-40 g/L in 72 
hours, for example, greater that 37 g/L in 72 hours. In another embodiment, the 
invention features a high yield production method of producing pantothenate that 
includes culturing a manipulated microorganism under conditions such that pantothenate 
is produced at a level greater than 30-40 g/L in 60 hours, for example, greater that 30, 35 

15 or 40 g/L in 60 hours. Values and ranges included and/or intermediate within the ranges 
set forth herein are also intended to be within the scope of the present invention. For 
example, pantothenate production at levels of at least 31, 32, 33, 34, 35, 36, 37, 38 and 
39 g/L in 60 hours are intended to be included within the range of 30-40 g/L in 60 hours. 
In another example, ranges of 30-35 g/L or 35-40 g/L are intended to be included within 

20 the range of 30-40 g/L in 60 hours. Moreover, the skilled artisan will appreciate that 
culturing a manipulated microorganism to achieve a production level of, for example, 
"30-40 g/L in 60 hours" includes culturing the microorganism for additional time 
periods (e.g. , time periods longer than 60 hours), optionally resulting in even higher 
yields of pantothenate being produced. 

25 

V. Panto-Compound Production Methodologies Featuring Pantothenate 
Kinase Mutant Microorganisms 

The present invention relates to methods of producing pantothenate using 
microorganisms engineered to produce high yields of pantothenate as well as other 

30 panto-compounds. Cells overproducing pantothenate result in high intracellular 

pantothenate levels that could overcome the feedback inhibition of pantothenate kinase 
by CoA, leading to overproduction of CoA. Besides consuming pantothenate, increased 
synthesis of CoA may cause increased feedback inhibition of the PanB, PanD, PanE or 
PanC reaction, thereby limiting pantothenate production. Accordingly, a reduction in 

35 pantothenate kinase activity may lead to a decrease in CoA levels with resulting 
increases in PanB, PanD, PanE or PanC activity and pantothenate production. 
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Thus, certain methodologies of the present invention are based, at least in part, 
on the identification and characterization of the B. subtilis coaA gene and the 
demonstration that the gene is neither essential for B. subtilis growth (i.e., deletion of the 
coaA gene from the chromosome of B. subtilis is not a lethal event) nor for pantothenate 
5 kinase activity in B. subtilis. A second pantothenate kinase-encoding gene has been 
identified and characterized in B. subtilis, and is termed "cocOC\ This gene 
complements an E. coli mutant that contains a temperature sensitive pantothenate kinase 
and is not related by homology to any previously known pantothenate kinase gene. 

In one aspect, the methodologies of the invention feature recombinant 
10 microorganisms that include the coaX gene or that include a mutant coaX gene, having 
reduced pantothenate kinase activity. In one embodiment, the methodologies feature 
such recombinant microorganisms further having a deregulated pantothenate 
biosynthetic pathway. In another embodiment, the methodologies feature such 
recombinant microorganisms further having a deregulated isoleucine-valine (/7v) 
1 5 pathway. In a preferred embodiment, the microorganisms belong to the genus Bacillus 
(e.g., B. subtilis). 

The methodologies of the invention also feature recombinant microorganisms 
{e.g., microorganisms belong to the genus Bacillus, for example, B. subtilis) that 
include the coaA gene or that include a mutant coaA gene, optionally including a coaX 
20 gene or mutant thereof, having reduced pantothenate kinase activity. In one 

embodiment, the methodologies feature such recombinant microorganisms further 
having a deregulated pantothenate biosynthetic pathway or having a deregulated 
isoleucine-valine (ih) pathway. Also featured are vectors that include isolated coaX or 
coaA genes as well as mutant coaX and/or coaA genes. Isolated nucleic acid molecules 
25 that include isolated coaX genes or mutant coaX genes are features in addition to 
isolated CoaX proteins and mutant CoaX proteins. 

The above-described nucleic acid molecules (e.g., genes), proteins, vectors, and 
recombinant microorganisms (e.g., mutant microorganisms), are particularly suited for 
use in methods of producing panto-compounds and/or methods of enhancing panto- 
30 compound production. In one embodiment, the invention features a method for 

producing a panto-compound (e.g., pantothenate) that includes culturing a pantothenate 
kinase mutant (e.g., a recombinant microorganism that misexpresses, e.g., is mutated 
for, pantothenate kinase, as defined herein) under conditions such that panto-compound 
is produced. In another embodiment, the invention features a method for enhancing 
35 production of a panto-compound (e.g., pantothenate) that includes culturing a 

pantothenate kinase mutant (e.g., a recombinant microorganism that misexpresses, e.g., 
is mutated for, pantothenate kinase, as defined herein) under conditions such that 
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production of the panto-compound is produced. As used herein, the term "enhancing" 
(for example, in the context of the phrase "enhancing production") includes increasing 
the level or rate of production of panto-compound (e.g., pantothenate) as compared to 
the level or rate of production in a non-mutant microorganism (e.g., a microorganism 
5 having a normal pantothenate kinase gene(s) and/or having normal pantothenate 
production rates and/or levels. 

Preferably, the level of panto-compound produced in methodologies featuring 
the pantothenate kinase mutants of the present invention is increased by at least 5% as 
compared to the level produced by a non-mutant (e.g., a recombinant microorganism 
10 expressing non-mutated pantothenate kinase). Even more preferably, the level of panto- 
compound is increased 10% as compared to methodologies featuring non-mutants. 
Even more preferably, panto-compound levels (e.g., pantothenate levels) are increased 
20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, are increased 2-fold, 5-fold, 10-fold, 50- 
fold, 100-fold or more as compared to methodologies featuring non-mutants. 

15 

VI. Additional Mutations Resulting in Enhanced Panto-Compound 
Production 

The methodologies of the present invention further can include, for example in 
addition to overexpressing or deregulating a pantothenate biosynthetic enzyme and/or an 

20 isoleucine- valine biosynthetic enzyme, or in addition to mutating a pantothenate-kinase 
encoding gene, deleting or mutating an enzyme that catalyzes the conversion of key 
pantothenate biosynthesis substrates or precursors to unwanted or undesirable products. 
For example, mutating the ilvE gene (Kuramitsu et al (1985) J. Biochem. 97:993-999) 
or a homologue thereof (SEQ ID NO:62 or SEQ ID NO:64), thereby limiting the 

25 conversion of a-ketoi so valerate to valine, in addition to mutating a pantothenate kinase 
encoding enzyme, is predicted to result in even further enhanced or increased production 
of panto-compound. Alternatively, mutating the ansB gene (Sun and Seflow (1991 ) J, 
Bacteriol 173:3831-3845) or a homologue thereof (SEQ ID NO:66), thereby limiting 
the degradation of aspartate, in addition to mutating a pantothenate kinase encoding 

30 enzyme, is predicted to result in even further enhanced or increased production of panto- 
compound. Alternatively, mutating the alsD gene (Renna et al (1 993) J. Bacteriol 
175:3863-3875) or a homologue thereof (SEQ ID NO:68), thereby limiting the 
conversion of acetolactate to acetoin, in addition to mutating a pantothenate kinase 
encoding enzyme, is predicted to result in even further enhanced or increased production 

35 of panto-compound. Alternatively, mutating the avtA gene encoding alanine-valine 
transaminase or a homologue thereof, thereby limiting the conversion of a- 
ketoisovalerate to valine, in addition to mutating a pantothenate kinase encoding 
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enzyme, is predicted to result in even further enhanced or increased production of panto- 
compound. Mutating the avtA gene can include mutating, for example, an avtA gene 
having the nucleotide sequence of SEQ ID NO:70 (e.g., the E. coli avtA gene), or a 
structural homolog thereof (e.g., a homologue encoding a protein having 30-40%, 40- 
5 50%, 50-60%, 60-70%, 70-80%, 80-90%, 90-95% or more identity with the amino acid 
sequence of SEQ ID NO:71) or a functional homologue (e.g., a gene encoding a 
structurally unrelated protein having alanine-valine transmainase activity. Such 
mutations can be accomplished using the methodologies as exemplified in the Examples 
(e.g., Examples XIII, XV, XVI and XVII). 

10 Accordingly, in one embodiment, the invention features a method of producing a 

panto-compound which includes culturing a microorganism having a mutant 
pantothenate kinase-encoding gene and which further has a deletion or mutation in an 
avtA, ilvE, ansB, and/or alsD gene, or homologue thereof. In another embodiment, the 
invention features a method of producing a panto-compound which includes culturing a 

1 5 microorganism having a mutant pantothenate-kinase encoding gene and a deregulated 
pantothenate biosynthetic pathway enzyme and which further has a deletion or mutation 
in an avtA, ilvE, ansB, and/or alsD gene, or homologue thereof. In another embodiment, 
the invention features a method of producing a panto-compound which includes 
culturing a microorganism having a mutant pantothenate-kinase encoding gene and a 

20 deregulated isoleucine-valine biosynthetic pathway enzyme and which further has a 
deletion or mutation in an avtA, ilvE, ansB, and/or alsD gene, or homologue thereof. 

Mutating the alsD gene can be particularly useful when accomplished in 
conjunction with overexpression or deregulation of the alsS gene, for example, to 
prevent carbon (e.g., acetolactate) from being drawn away from the precursor pool 

25 utilized for a-KIV production. Accordingly, to maximize the contribution of the als 
locus to panto-compound production, it is desirable to disrupt the alsD gene in addition 
to overexpressing the alsS gene. To disrupt the alsD gene, appropriate fragments of the 
als operon, flanking the alsD gene, are amplified by PCR and cloned to provide 
homology for creating the disruptions. A drug resistance gene, such as the cat gene, is 

30 cloned between the flanking DNA fragments in place of the alsD gene, and the 

linearized DNA is transformed into a pantothenate production strain such as PA824, 
selecting for drug-resistance. To overexpress alsS, the alsS coding sequence (e.g., an 
alsS coding sequence that has been engineered by PCR for expression) is cloned into an 
expression vector. Vectors which express alsS (or alternatively, vectors which express 

35 alsS plus ilvC) are the introduced into panto-compound production strains (e.g. , the 
pantothenate producing strain PA824). 
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The methodologies of the present invention further can include, for example in 
addition to overexpressing or deregulating a pantothenate biosynthetic enzyme and/or an 
isoleucine-valine biosynthetic enzyme, or in addition to mutating a pantothenate-kinase 
encoding gene, deleting or mutating an enzyme that catalyzes the conversion of desired 
5 panto-compounds to unwanted or undesireable downstream products. 

VII. Isolated Nucleic Acid Molecules and Genes 

Another aspect of the present invention features isolated nucleic acid molecules 
that encode Bacillus proteins {e.g.,B. subtilis proteins), for example, Bacillus 

1 0 pantothenate biosynthetic enzymes {e.g. , B. subtilis pantothenate biosynthetic enzymes) 
or Bacillus valine-isoleucine biosynthetic enzymes {e.g., B. subtilis valine-isoleucine 
biosynthetic enzymes). Also featured are isolated coaX and/or coaA nucleic acid 
molecules {e.g., isolated coaX and/or coaA genes) as well as isolated nucleic acid 
molecules that include such coaX and/or coaA nucleic acid molecules or genes. 

1 5 The term "nucleic acid molecule" includes DNA molecules {e.g. , linear, circular, 

cDNA or chromosomal DNA) and RNA molecules {e.g., tRNA, rRNA, mRNA) and 
analogs of the DNA or RNA generated using nucleotide analogs. The nucleic acid 
molecule can be single-stranded or double-stranded, but preferably is double-stranded 
DNA. The term "isolated" nucleic acid molecule includes a nucleic acid molecule 

20 which is free of sequences which naturally flank the nucleic acid molecule {i.e., 

sequences located at the 5' and 3' ends of the nucleic acid molecule) in the chromosomal 
DNA of the organism from which the nucleic acid is derived. In various embodiments, 
an isolated nucleic acid molecule can contain less than about 10 kb, 5 kb, 4kb, 3kb, 2kb, 
1 kb, 0.5 kb, 0.1 kb, 50 bp, 25 bp or 10 bp of nucleotide sequences which naturally flank 

25 the nucleic acid molecule in chromosomal DNA of the microorganism from which the 
nucleic acid molecule is derived. Moreover, an "isolated" nucleic acid molecule, such 
as a cDNA molecule, can be substantially free of other cellular materials when produced 
by recombinant techniques, or substantially free of chemical precursors or other 
chemicals when chemically synthesized. 

30 The term "gene", as used herein, includes a nucleic acid molecule {e.g., a DNA 

molecule or segment thereof), for example, a protein or RNA-encoding nucleic acid 
molecule, that in an organism, is separated from another gene or other genes, by 
intergenic DNA {i.e., intervening or spacer DNA which naturally flanks the gene and/or 
separates genes in the chromosomal DNA of the organism). A gene may direct 

35 synthesis of an enzyme or other protein molecule {e.g. , may comprise coding sequences, 
for example, a contiguous open reading frame (ORF) which encodes a protein) or may 
itself be functional in the organism. A gene in an organism, may be clustered in an 
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operon, as defined herein, said operon being separated from other genes and/or operons 
by the intergenic DNA. Individual genes contained within an operon may overlap 
without intergenic DNA between said individual genes. An "isolated gene", as used 
herein, includes a gene which is essentially free of sequences which naturally flank the 
5 gene in the chromosomal DNA of the organism from which the gene is derived (i.e., is 
free of adjacent coding sequences which encode a second or distinct protein or RNA 
molecule, adjacent structural sequences or the like) and optionally includes 5' and 3' 
regulatory sequences, for example promoter sequences and/or terminator sequences. In 
one embodiment, an isolated gene includes predominantly coding sequences for a 

1 0 protein (e.g. , sequences which encode Bacillus proteins). In another embodiment, an 
isolated gene includes coding sequences for a protein (e.g., for a Bacillus protein) and 
adjacent 5' and/or 3' regulatory sequences from the chromosomal DNA of the organism 
from which the gene is derived (e.g., adjacent 5' and/or 3' Bacillus regulatory 
sequences). Preferably, an isolated gene contains less than about 10 kb, 5 kb, 2 kb, 1 kb, 

15 0.5 kb, 0.2 kb, 0.1 kb, 50 bp, 25 bp or 10 bp of nucleotide sequences which naturally 
flank the gene in the chromosomal DNA of the organism from which the gene is 
derived. 

In one aspect, the present invention features isolated panB nucleic acid 
sequences or genes, isolated panC nucleic acid sequences or genes, isolated panD 

20 nucleic acid sequences or genes, isolated panE nucleic acid sequences or genes, isolated 
ilvB, ilvN, ilvBN nucleic acid sequences or genes, isolated alsS nucleic acid sequences or 
genes, isolated ilvC nucleic acid sequences or genes and/or isolated ilvD nucleic acid 
sequences or genes. 

In a preferred embodiment, the nucleic acid or gene is derived from Bacillus 

25 (e.g., is Btfc/7/ws-derived). The term "derived from Bacillus" or "Ztac/7/w.s-derived" 

includes a nucleic acid or gene which is naturally found in microorganisms of the genus 
Bacillus. Preferably, the nucleic acid or gene is derived from a microorganism selected 
from the group consisting of Bacillus subtilis, Bacillus lentimorbus, Bacillus lentus, 
Bacillus firmus, Bacillus pantothenticus, Bacillus amyloliquefaciens, Bacillus cereus, 

30 Bacillus circulans, Bacillus coagulans, Bacillus licheniformis, Bacillus megaterium, 
Bacillus pumilus, Bacillus thuringiensis, and other Group 1 Bacillus species, for 
example, as characterized by 16S rRNA type (Priest, supra). In another preferred 
embodiment, the nucleic acid or gene is derived from Bacillus brevis or Bacillus 
stearothermophilus. In another preferred embodiment, the nucleic acid molecules 

35 and/or genes of the present invention are derived from a microorganism selected from 
the group consisting of Bacillus licheniformis, Bacillus amyloliquefaciens, Bacillus 
halodurans, Bacillus subtilis, and Bacillus pumilus. In a particularly preferred 
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embodiment, the nucleic acid or gene is derived from Bacillus subtilis (e.g., is Bacillus 
subtilis-derived). The term "derived from Bacillus subtilis" or "Bacillus subtilis- 
derived" includes a nucleic acid or gene which is naturally found in Bacillus subtilis. In 
yet another preferred embodiment, the nucleic acid or gene is a Bacillus gene 
5 homologue (e.g., is derived from a species distinct from Bacillus but having significant 
homology to a Bacillus gene of the present invention, for example, a Bacillus pan gene 
or Bacillus ilv gene). 

Included within the scope of the present invention are bacterial-derived nucleic 
acid molecules or genes and/or Bacillus-derived nucleic acid molecules or genes (e.g., 

10 B. subtilis-derived nucleic acid molecules or genes), for example, the genes identified by 
the present inventors, for example, Bacillus or B. subtilis coaX genes, coaA genes, pan 
genes and/or ilv genes. Further included within the scope of the present invention are 
bacterial-derived nucleic acid molecules or genes and/or Bacillus-derived nucleic acid 
molecules or genes (e.g., B. subtilis-derived nucleic acid molecules or genes) (e.g. y B. 

1 5 subtilis nucleic acid molecules or genes) which differ from naturally-occurring bacterial 
and/or Bacillus nucleic acid molecules or genes (e.g., B. subtilis nucleic acid molecules 
or genes), for example, nucleic acid molecules or genes which have nucleic acids that 
are substituted, inserted or deleted, but which encode proteins substantially similar to the 
naturally-occurring gene products of the present invention. In one embodiment, an 

20 isolated nucleic acid molecule comprises at least one of the nucleotide sequences set 
forth as SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, SEQ ID NO 
31, SEQ ID NO:33, SEQ ID NO:86 ; SEQ ID NO:35 or SEQ ID NO:37. In another 
preferred embodiment, an isolated nucleic acid molecule comprises at least two, three or 
four of the nucleotide sequences set forth as SEQ ID NO:23, SEQ ID NO:25, SEQ ID 

25 NO:27, SEQ ID NO:29, SEQ ID NO 3 1, SEQ ID NO:33, SEQ ID NO:88, SEQ ID 
NO:35 or SEQ ID NO:37. For example, a preferred isolated nucleic acid molecule of 
the present invention can include the nucleotide sequences of SEQ ID NO:23, SEQ ID 
NO:25 and SEQ ID NO:27, preferably linked such that the proteins encoded by the 
nucleotide sequences of SEQ ID NO:23, SEQ ID NO:25 and SEQ ID NO:27 are each 

30 produced when the isolated nucleic acid molecule is expressed in a microorganism (e.g., 
SEQ ID NO: 59). In another example, a preferred isolated nucleic acid molecule of the 
present invention can include the nucleotide sequences of SEQ ID NO:31 and SEQ ID 
NO:33, preferably linked such that the proteins encoded by the nucleotide sequences of 
SEQ ID NO:3 1 and SEQ ID NO:33 are each produced when the isolated nucleic acid 

35 molecule is expressed in a microorganism (e.g., nucleotides 1-2246 of SEQ ID NO:58). 
In another example, a preferred isolated nucleic acid molecule of the present invention 
can include the nucleotide sequence of SEQ ID NO:86. In another example, a preferred 
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isolated nucleic acid molecule of the present invention can include the nucleotide 
sequences of SEQ ID NO:31, SEQ ID NO:33 and SEQ ID NO:35, preferably linked 
such that the proteins encoded by the nucleotide sequences of SEQ ID NO:3 1 , SEQ ID 
NO:33 and SEQ ID NO:35 are each produced when the isolated nucleic acid molecule is 
5 expressed in a microorganism (e.g., SEQ ID NO:58). 

In another embodiment, an isolated nucleic acid molecule of the present 
invention comprises a nucleotide sequence which is at least about 60-65%, preferably at 
least about 70-75%, more preferable at least about 80-85%, and even more preferably at 
least about 90-95% or more identical to a nucleotide sequence set forth as SEQ ID 

10 NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, SEQ ID NO 31, SEQ ID 

NO:33, SEQ ID NO:88, SEQ ID NO:35 or SEQ ID NO:37. In another embodiment, an 
isolated nucleic acid molecule hybridizes under stringent conditions to a nucleic acid 
molecule having a nucleotide sequence set forth as SEQ ID NO:23, SEQ ID NO:25, 
SEQ ID NO:27, SEQ ID NO:29, SEQ ID NO 31, SEQ ID NO:33, SEQ ID NO:88, SEQ 

15 ID NO:35 or SEQ ID NO:37. Such stringent conditions are known to those skilled in 
the art and can be found in Current Protocols in Molecular Biology, John Wiley & Sons, 
N.Y. (1989), 6.3.1-6.3.6. A preferred, non-limiting example of stringent (e.g. high 
stringency) hybridization conditions are hybridization in 6X sodium chloride/sodium 
citrate (SSC) at about 45°C, followed by one or more washes in 0.2 X SSC, 0.1% SDS 

20 at 50-65°C. Preferably, an isolated nucleic acid molecule of the invention that 
hybridizes under stringent conditions to the sequence of SEQ ID NO:23, SEQ ID 
NO:25, SEQ ID NO:27, SEQ ID NO:29, SEQ ID NO 31, SEQ ID NO:33, SEQ ID 
NO:88, SEQ ID NO:35 or SEQ ID NO:37 corresponds to a naturally-occurring nucleic 
acid molecule. As used herein, a "naturally-occurring" nucleic acid molecule refers to 

25 an RNA or DNA molecule having a nucleotide sequence that occurs in nature. 

A nucleic acid molecule of the present invention (e.g., a nucleic acid molecule 
having the nucleotide sequence of SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, 
SEQ ID NO:29, SEQ ID NO 3 1 , SEQ ID NO:33, SEQ ID NO:88, SEQ ID NO:35 or 
SEQ ID NO:37 can be isolated using standard molecular biology techniques and the 

30 sequence information provided herein. For example, nucleic acid molecules can be 
isolated using standard hybridization and cloning techniques (e.g., as described in 
Sambrook, J., Fritsh, E. F., and Maniatis, T. Molecular Cloning: A Laboratory Manual. 
2nd, ed. y Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY, 1989) or can be isolated by the polymerase chain reaction using 

35 synthetic oligonucleotide primers designed based upon the sequence of SEQ ID NO:23, 
SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, SEQ ID NO 31, SEQ ID NO:33, SEQ 
ID NO:88, SEQ ID NO:35 or SEQ ID NO:37. A nucleic acid of the invention can be 
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amplified using cDNA, mRNA or alternatively, genomic DNA, as a template and 
appropriate oligonucleotide primers according to standard PCR amplification 
techniques. In another preferred embodiment, an isolated nucleic acid molecule of the 
invention comprises a nucleic acid molecule which is a complement of the nucleotide 
5 sequence shown in SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, 
SEQ ID NO:33, SEQ ID NO 31, SEQ ID NO:33, SEQ ID NO:88, SEQ ID NO:35. 

Additional panC nucleic acid sequences include those that comprise the 
nucleotide sequence of SEQ ID NO:25, encode a homologue of the polypeptide having 
the amino acid sequence set forth in SEQ ID NO:26 (e.g., encode a polypeptide having 

10 at least 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95% or more identity to the polypeptide 
having the amino acid sequence as set forth in SEQ ID NO:26 and a substantially 
identical activity as said polypeptide), hybridize under stringent conditions to all or a 
portion of a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:25 or 
to all or a portion of a nucleic acid molecule that encodes a polypeptide having the 

1 5 amino acid sequence of SEQ ID NO:26, or are complementary to a panC nucleotide 
sequence as set forth herein. 

Aditional panD nucleic acid sequences include those that comprise the 
nucleotide sequence of SEQ ID NO:27, encode a homologue of the polypeptide having 
the amino acid sequence set forth in SEQ ID NO:28 (e.g., encode a polypeptide having 

20 at least 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95% or more identity to the polypeptide 
having the amino acid sequence as set forth in SEQ ID NO:28 and a substantially 
identical activity as said polypeptide), hybridize under stringent conditions to all or a 
portion of a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:27 or 
to all or a portion of a nucleic acid molecule that encodes a polypeptide having the 

25 amino acid sequence of SEQ ID NO:28, or are complementary to a panD nucleotide 
sequence as set forth herein. 

Additional panE nucleic acid sequences include those that comprise the 
nucleotide sequence of SEQ ID NO:29, encode a homologue of the polypeptide having 
the amino acid sequence set forth in SEQ ID NO:30 (e.g., encode a polypeptide having 

30 at least 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95% or more identity to the polypeptide 
having the amino acid sequence as set forth in SEQ ID NO:30 and a substantially 
identical activity as said polypeptide), hybridize under stringent conditions to all or a 
portion of a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:29 or 
to all or a portion of a nucleic acid molecule that encodes a polypeptide having the 

35 amino acid sequence of SEQ ID NO:30, or are complementary to a panE nucleotide 
sequence as set forth herein. 
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Additional ilvB nucleic acid sequences are those that comprise the nucleotide 
sequence of SEQ ID NO:3 1 , encode a homologue of the polypeptide having the amino 
acid sequence set forth in SEQ ID NO:32 (e.g., encode a polypeptide having at least 
30%, 40%, 50%, 60%, 70%, 80%, 90%, 95% or more identity to the polypeptide having 
5 the amino acid sequence as set forth in SEQ ID NO:32 and a substantially identical 
activity as said polypeptide), hybridize under stringent conditions to all or a portion of a 
nucleic acid molecule having the nucleotide sequence of SEQ ID NO:3 1 or to all or a 
portion of a nucleic acid molecule that encodes a polypeptide having the amino acid 
sequence of SEQ ID NO:32, or are complementary to an ilvB nucleotide sequence as set 
10 forth herein. 

Additional ilvN nucleic acid sequences are those that comprise the nucleotide 
sequence of SEQ ID NO:33, encode a homologue of the polypeptide having the amino 
acid sequence set forth in SEQ ID NO:34 (e.g., encode a polypeptide having at least 
30%, 40%, 50%, 60%, 70%, 80%, 90%, 95% or more identity to the polypeptide having 

! 5 the amino acid sequence as set forth in SEQ ID NO:34 and a substantially identical 

activity as said polypeptide), hybridize under stringent conditions to all or a portion of a 
nucleic acid molecule having the nucleotide sequence of SEQ ID NO:33 or to all or a 
portion of a nucleic acid molecule that encodes a polypeptide having the amino acid 
sequence of SEQ ID NO:34, or are complementary to an ilvN nucleotide sequence as set 

20 forth herein. 

Additional /7vC nucleic acid sequences include those that comprise the 
nucleotide sequence of SEQ ID NO:35, encode a homologue of the polypeptide having 
the amino acid sequence set forth in SEQ ID NO:36 (e.g., encode a polypeptide having 
at least 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95% or more identity to the polypeptide 

25 having the amino acid sequence as set forth in SEQ ID NO:36 and a substantially 
identical activity as said polypeptide), hybridize under stringent conditions to all or a 
portion of a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:35 or 
to all or a portion of a nucleic acid molecule that encodes a polypeptide having the 
amino acid sequence of SEQ ID NO:36, or are complementary to an ilvC nucleotide 

30 sequence as set forth herein. 

Additional ilvD nucleic acid sequences include those that comprise the 
nucleotide sequence of SEQ ID NO:37, encode a homologue of the polypeptide having 
the amino acid sequence set forth in SEQ ID NO:38 (e.g., encode a polypeptide having 
at least 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95% or more identity to the polypeptide 

35 having the amino acid sequence ais set forth in SEQ ID NO:38 and a substantially 
identical activity as said polypeptide), hybridize under stringent conditions to all or a 
portion of a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:37 or 
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to all or a portion of a nucleic acid molecule that encodes a polypeptide having the 
amino acid sequence of SEQ ID NO:38, or are complementary to an ilvD nucleotide 
sequence as set forth herein. 

Additional alsS nucleic acid sequences include those that comprise the 
5 nucleotide sequence of SEQ ID NO: 86, encode a homologue of the polypeptide having 
the amino acid sequence set forth in SEQ ID NO:87 (e.g., encode a polypeptide having 
at least 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95% or more identity to the polypeptide 
having the amino acid sequence as set forth in SEQ ID NO:87 and a substantially 
identical activity as said polypeptide), hybridize under stringent conditions to all or a 

10 portion of a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:86 or 
to all or a portion of a nucleic acid molecule that encodes a polypeptide having the 
amino acid sequence of SEQ ID NO:87, or are complementary to an alsS nucleotide 
sequence as set forth herein. 

In another embodiment, an isolated nucleic acid molecule is or includes a coaX 

1 5 gene, or portion or fragment thereof. In one embodiment, an isolated coaX nucleic acid 
molecule or gene comprises the nucleotide sequence as set forth in SEQ ID NO: 19 (e.g., 
comprises the B. subtilis coaX nucleotide sequence). In another embodiment, an 
isolated coaX nucleic acid molecule or gene comprises a nucleotide sequence that 
encodes the amino acid sequence as set forth in SEQ ID NO:9 (e.g., encodes the B. 

20 subtilis CoaX amino acid sequence). In yet another embodiment, an isolated coaX 
nucleic acid molecule or gene encodes a homologue of the CoaX protein having the 
amino acid sequence of SEQ ID NO:9. As used herein, the term "homologue" includes 
a protein or polypeptide sharing at least about 30-35%, preferably at least about 35-40%, 
more preferably at least about 40-50%, and even more preferably at least about 60%, 

25 70%, 80%, 90% or more identity with the amino acid sequence of a wild-type protein or 
polypeptide described herein and having a substantially equivalent functional or 
biological activity as said wild-type protein or polypeptide. For example, a CoaX 
homologue shares at least about 30-35%, preferably at least about 35-40%, more 
preferably at least about 40-50%, and even more preferably at least about 60%, 70%, 

30 80%, 90% or more identity with the protein having the amino acid sequence set forth as 
SEQ ID NO:9 and has a substantially equivalent functional or biological activity (i.e., is 
a functional equivalent) of the protein having the amino acid sequence set forth as SEQ 
ID NO:9 (e.g., has a substantially equivalent pantothenate kinase activity). In a 
preferred embodiment, an isolated coaX nucleic acid molecule or gene comprises a 

35 nucleotide sequence that encodes a polypeptide as set forth in any one of SEQ ID NO: 7, 
SEQ ID NO:8, SEQ ID NO: 10, SEQ ID NO:l 1, SEQ ID NO: 12, SEQ ID NO: 13, SEQ 
ID NO: 1 4, SEQ ID NO: 1 5, SEQ ID NO: 1 6, SEQ ID NO: 1 7, SEQ ID NO: 1 8, SEQ ID 
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NO:74 or SEQ ID NO:75. In another embodiment, an isolated coaX nucleic acid 
molecule hybridizes to all or a portion of a nucleic acid molecule having the nucleotide 
sequence set forth in SEQ ID NO: 19 or hybridizes to all or a portion of a nucleic acid 
molecule having a nucleotide sequence that encodes a polypeptide having the amino 
5 acid sequence of any of SEQ ID NOs:7- 1 8, 74 or 75. Such hybridization conditions are 
known to those skilled in the art and can be found in Current Protocols in Molecular 
Biology ; Ausubel et aL, eds., John Wiley & Sons, Inc. (1995), sections 2, 4 and 6. 
Additional stringent conditions can be found in Molecular Cloning: A Laboratory 
Manual, Sambrook et al, Cold Spring Harbor Press, Cold Spring Harbor, NY (1989), 

10 chapters 7, 9 and 1 1 . A preferred, non-limiting example of stringent hybridization 

conditions includes hybridization in 4X sodium chloride/sodium citrate (SSC), at about 
65-70°C (or hybridization in 4X SSC plus 50% formamide at about 42-50°C) followed 
by one or more washes in IX SSC, at about 65-70°C. A preferred, non-limiting 
example of highly stringent hybridization conditions includes hybridization in IX SSC, 

1 5 at about 65-70°C (or hybridization in 1 X SSC plus 50% formamide at about 42-50°C) 
followed by one or more washes in 0.3X SSC, at about 65-70°C A preferred, non- 
limiting example of reduced stringency hybridization conditions includes hybridization 
in 4X SSC, at about 50-60°C (or alternatively hybridization in 6X SSC plus 50% 
formamide at about 40-45°C) followed by one or more washes in 2X SSC, at about 50- 

20 60°C. Ranges intermediate to the above-recited values, e.g., at 65-70°C or at 42-50°C 
are also intended to be encompassed by the present invention. SSPE (IX SSPE is 0.15 
M NaCl, lOmM NaH 2 P0 4 , and 1.25 mM EDTA, pH 7.4) can be substituted for SSC (IX 
SSC is 0.1 5 M NaCl and 15 mM sodium citrate) in the hybridization and wash buffers; 
washes are performed for 1 5 minutes each after hybridization is complete. The 

25 hybridization temperature for hybrids anticipated to be less than 50 base pairs in length 
should be 5-10°C less than the melting temperature (T m ) of the hybrid, where T m is 
determined according to the following equations. For hybrids less than 1 8 base pairs in 
length, T m (°C) = 2(# of A + T bases) + 4(# of G + C bases). For hybrids between 1 8 and 
49 base pairs in length, T m (°C) = 81.5 + 16.6(log 10 [Na + ]) + 0.41(%G+C) - (600/N), 

30 where N is the number of bases in the hybrid, and [Na + ] is the concentration of sodium 
ions in the hybridization buffer ([Na + ] for IX SSC = 0.165 M). It will also be 
recognized by the skilled practitioner that additional reagents may be added to 
hybridization and/or wash buffers to decrease non-specific hybridization of nucleic acid 
molecules to membranes, for example, nitrocellulose or nylon membranes, including but 

35 not limited to blocking agents (e.g., BSA or salmon or herring sperm carrier DNA), 
detergents (e.g., SDS), chelating agents (e.g., EDTA), Ficoll, PVP and the like. When 
using nylon membranes, in particular, an additional preferred, non-limiting example of 
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stringent hybridization conditions is hybridization in 0.25-0.5M NaH 2 P0 4 , 7% SDS at 
about 65°C, followed by one or more washes at 0.02M NaH 2 P0 4 , 1% SDS at 65°C, see 
e.g., Church and Gilbert (1984) Proc. Natl Acad. Sci. USA 81 : 1991-1995, (or, 
alternatively, 0.2X SSC, 1% SDS). In another preferred embodiment, an isolated 
5 nucleic acid molecule comprises a nucleotide sequence that is complementary to a coaX 
nucleotide sequence as set forth herein (e.g., is the full complement of the nucleotide 
sequence set forth as SEQ ID NO: 19). 

In another preferred embodiment, an isolated nucleic acid molecule is or 
includes a coaA gene, for example, a Bacillus (e.g., B. subtilis) coaA gene, or portion or 

10 fragment thereof. Exemplary isolated coaA nucleic acid molecules and/or genes include 
(1) an isolated coaA nucleic acid molecule or gene comprising the nucleotide sequence 
as set forth in any one of SEQ ID NOs:20-22; (2) an isolated coaA nucleic acid molecule 
or gene comprising a nucleotide sequence that encodes the amino acid sequence as set 
forth in SEQ ID NO:3; (3) an isolated coaA nucleic acid molecule or gene comprising a 

1 5 nucleotide sequence which encodes a CoaA homologue (e.g., a polypeptide having an 
amino acid sequence at least about 30-35%, preferably at least about 35-40%, more 
preferably at least about 40-50%, and even more preferably at least about 60%, 70%, 
80%, 90% or more identical to the amino acid sequence set forth as SEQ ID NO:3 and 
having a substantially equivalent enzymatic activity; (4) an isolated coaA nucleic acid 

20 molecule or gene comprising a nucleotide sequence that encodes a polypeptide as set 
forth in any one of SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:5 or SEQ 
ID NO:6; (5) an isolated nucleic acid molecule that hybridizes under stringent 
conditions to all or a portion of a nucleic acid molecule having the nucleotide sequence 
set forth in SEQ ID NO:20, SEQ ID NO:21 or SEQ ID NO:22 or hybridizes to all or a 

25 portion of a nucleic acid molecule having a nucleotide sequence that encodes a 
polypeptide having the amino acid sequence of SEQ ID NO:3; and (6) an isolated 
nucleic acid molecule comprising a nucleotide sequence that is complementary to a 
coaA nucleotide sequence as set forth herein (e.g., is the full complement of the 
nucleotide sequence set forth in SEQ ID NO:20, SEQ ID NO:21 or SEQ ID NO:22). 

30 A nucleic acid molecule of the present invention (e.g., a coaX nucleic acid 

molecule or gene or a coaA nucleic acid molecule or gene), can be isolated using 
standard molecular biology techniques and the sequence information provided herein. 
For example, nucleic acid molecules can be isolated using standard hybridization and 
cloning techniques (e.g., as described in Sambrook, J., Fritsh, E. F., and Maniatis, T. 

35 Molecular Cloning: A Laboratory Manual 2nd, ed f Cold Spring Harbor Laboratory, 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989) or can be isolated 
by the polymerase chain reaction using synthetic oligonucleotide primers designed 
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based upon the coaX or coaA nucleotide sequences set forth herein, or flanking 
sequences thereof. A nucleic acid of the invention {e.g., a coaX nucleic acid molecule or 
gene or a coaA nucleic acid molecule or gene), can be amplified using cDNA, mRNA or 
alternatively, chromosomal DNA, as a template and appropriate oligonucleotide primers 
5 according to standard PCR amplification techniques. 

Yet another embodiment of the present invention features mutant coaX and coaA 
nucleic acid molecules or genes. The phrase "mutant nucleic acid molecule" or "mutant 
gene" as used herein, includes a nucleic acid molecule or gene having a nucleotide 
sequence which includes at least one alteration (e.g., substitution, insertion, deletion) 
1 0 such that the polypeptide or protein that may be encoded by said mutant exhibits an 
activity that differs from the polypeptide or protein encoded by the wild-type nucleic 
acid molecule or gene. Preferably, a mutant nucleic acid molecule or mutant gene (e.g., 
a mutant coaA or coaX gene) encodes a polypeptide or protein having a reduced activity 
(e.g., having a reduced pantothenate kinase activity) as compared to the polypeptide or 
1 5 protein encoded by the wild-type nucleic acid molecule or gene, for example, when 
assayed under similar conditions (e.g., assayed in microorganisms cultured at the same 
temperature). A mutant gene also can encode no polypeptide or have a reduced level of 
production of the wild-type polypeptide. 

As used herein, a "reduced activity" or "reduced enzymatic activity" is one that 
20 is at least 5% less than that of the polypeptide or protein encoded by the wild-type 
nucleic acid molecule or gene, preferably at least 5-10% less, more preferably at least 
10-25% less and even more preferably at least 25-50%, 50-75% or 75-100% less than 
that of the polypeptide or protein encoded by the wild-type nucleic acid molecule or 
gene. Ranges intermediate to the above-recited values, e.g., 75-85%, 85-90%, 90-95%, 
are also intended to be encompassed by the present invention. As used herein, a 
"reduced activity" or "reduced enzymatic activity" also includes an activity that has 
been deleted or "knocked out" (e.g., approximately 100% less activity than that of the 
polypeptide or protein encoded by the wild-type nucleic acid molecule or gene). 
Activity can be determined according to any well accepted assay for measuring activity 
of a particular protein of interest. Activity can be measured or assayed directly, for 
example, measuring an activity of a protein isolated or purified from a cell. 
Alternatively, an activity can be measured or assayed within a cell or in an extracellular 
medium. For example, assaying for a mutant coaA gene or a mutant coaX gene (i.e. , 
said mutant encoding a reduced pantothenate kinase activity) can be accomplished by 
expressing the mutated gene in a microorganism, for example, a mutant microorganism 
which expresses pantothenate kinase in a temperature-sensitive manner, assaying the 
mutant gene for the ability to complement a temperature sensitive (Ts) mutant for 
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pantothenate kinase activity. A coaX mutant gene or coaA mutant gene that encodes a 
"reduced pantothenate kinase activity" is one that complements the Ts mutant less 
effectively than, for example, a corresponding wild-type coaX gene or coaA gene. 

It will be appreciated by the skilled artisan that even a single substitution in a 
5 nucleic acid or gene sequence {e.g., a base substitution that encodes an amino acid 

change in the corresponding amino acid sequence) can dramatically affect the activity of 
an encoded polypeptide or protein as compared to the corresponding wild-type 
polypeptide or protein. A mutant nucleic acid or mutant gene (e.g., encoding a mutant 
polypeptide or protein), as defined herein, is readily distinguishable from a nucleic acid 

10 or gene encoding a protein homologue, as described above, in that a mutant nucleic acid 
or mutant gene encodes a protein or polypeptide having an altered activity, optionally 
observable as a different or distinct phenotype in a microorganism expressing said 
mutant gene or nucleic acid or producing said mutant protein or polypeptide (i.e., a 
mutant microorganism) as compared to a corresponding microorganism expressing the 

1 5 wild-type gene or nucleic acid or producing said mutant protein or polypeptide. By 
contrast, a protein homologue has an identical or substantially similar activity, 
optionally phenotypically indiscernable when produced in a microorganism, as 
compared to a corresponding microorganism expressing the wild-type gene or nucleic 
acid. Accordingly it is not, for example, the degree of sequence identity between 

20 nucleic acid molecules, genes, protein or polypeptides that serves to distinguish between 
homologues and mutants, rather it is the activity of the encoded protein or polypeptide 
that distinguishes between homologues and mutants: homologues having, for example, 
low (e.g., 30-50% sequence identity) sequence identity yet having substantially 
equivalent functional activities, and mutants, for example sharing 99% sequence identity 

25 yet having dramatically different or altered functional activities. Exemplary 

homologues are set forth in Figure 20 (i.e., CoaA homologues) and in Figure 23 (i.e., 
CoaX homologues). Exemplary mutants are described in Examples XV and XVIII 
herein. 



30 VIII. Recombinant Nucleic A cid Molecules and Vectors 

The present invention further features recombinant nucleic acid molecules (e.g., 
recombinant DNA molecules) that include nucleic acid molecules and/or genes 
described herein (e.g., isolated nucleic acid molecules and/or genes), preferably Bacillus 
genes, more preferably Bacillus subtilis genes, even more preferably Bacillus subtilis 

35 pantothenate kinase genes (e.g., coaX genes or coaA genes), pantothenate biosynthetic 
genes (e.g., genes encoding pantothenate biosynthetic enzymes, for example, panB 
genes encoding ketopantoate hydroxymethyltransferase, panE genes encoding 
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ketopantoate reductase, panC genes encoding pantothenate synthetase, and/or panD 
genes encoding aspartate-ct-decarboxylase) and/or isoleucine-valine (/7v) biosynthetic 
genes (e.g., ilvBN or alsS genes encoding acetohydroxyacid synthetase, ilvC genes 
encoding acetohydroxyacid isomeroreductase and/or ilvD genes encoding dihydroxyacid 
5 dehydratase). 

The present invention further features vectors (e.g., recombinant vectors) that 
include nucleic acid molecules (e.g., isolated or recombinant nucleic acid molecules 
and/or genes) described herein. In particular, recombinant vectors are featured that 
include nucleic acid sequences that encode bacterial gene products as described herein, 

10 preferably Bacillus gene products, more preferably Bacillus subtilis gene products, even 
more preferably Bacillus subtilis pantothenate biosynthetic gene products (e.g., 
pantothenate biosynthetic enzymes, for example, ketopantoate 
hydroxymethyltransferase, ketopantoate reductase, pantothenate synthetase, and/or 
aspartate-a-decarboxylase) and/or isoleucine- valine biosynthetic gene products (e.g., 

15 acetohydroxyacid synthetase, acetohydroxyacid isomeroreductase and/or dihydroxyacid 
dehydratase). 

The term "recombinant nucleic acid molecule" includes a nucleic acid molecule 
(e.g., a DNA molecule) that has been altered, modified or engineered such that it differs 
in nucleotide sequence from the native or natural nucleic acid molecule from which the 

20 recombinant nucleic acid molecule was derived (e.g., by addition, deletion or 
substitution of one or more nucleotides). Preferably, a recombinant nucleic acid 
molecule (e.g., a recombinant DNA molecule) includes an isolated nucleic acid 
molecule or gene of the present invention (e.g., an isolated coaX, coaA, pan or ilv gene) 
operably linked to regulatory sequences. 

25 The term "recombinant vector" includes a vector (e.g., plasmid, phage, phasmid, 

. virus, cosmid or other purified nucleic acid vector) that has been altered, modified or 
engineered such that it contains greater, fewer or different nucleic acid sequences than 
those included in the native or natural nucleic acid molecule from which the 
recombinant vector was derived. Preferably, the recombinant vector includes a coaX, 

30 coaA, pan or ilv gene or recombinant nucleic acid molecule including such coaX, coaA, 
pan or ilv gene, operably linked to regulatory sequences, for example, promoter 
sequences, terminator sequences and/or artificial ribosome binding sites (RBSs), as 
defined herein. 

The phrase "operably linked to regulatory sequence(s)" means that the 
35 nucleotide sequence of the nucleic acid molecule or gene of interest is linked to the 
regulatory sequence(s) in a manner which allows for expression (e.g., enhanced, 
increased, constitutive, basal, attenuated, decreased or repressed expression) of the 
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nucleotide sequence, preferably expression of a gene product encoded by the nucleotide 
sequence (e.g., when the recombinant nucleic acid molecule is included in a 
recombinant vector, as defined herein, and is introduced into a microorganism). 

The term "regulatory sequence" includes nucleic acid sequences which affect 
5 (e.g. , modulate or regulate) expression of other nucleic acid sequences. In one 

embodiment, a regulatory sequence is included in a recombinant nucleic acid molecule 
or recombinant vector in a similar or identical position and/or orientation relative to a 
particular gene of interest as is observed for the regulatory sequence and gene of interest 
as it appears in nature, e.g., in a native position and/or orientation. For example, a gene 

10 of interest can be included in a recombinant nucleic acid molecule or recombinant vector 
operably linked to a regulatory sequence which accompanies or is adjacent to the gene 
of interest in the natural organism (e.g., operably linked to "native" regulatory 
sequences, for example, to the "native" promoter). Alternatively, a gene of interest can 
be included in a recombinant nucleic acid molecule or recombinant vector operably 

1 5 linked to a regulatory sequence which accompanies or is adjacent to another (e.g. , a 
different) gene in the natural organism. Alternatively, a gene of interest can be included 
in a recombinant nucleic acid molecule or recombinant vector operably linked to a 
regulatory sequence from another organism. For example, regulatory sequences from 
other microbes (e.g., other bacterial regulatory sequences, bacteriophage regulatory 

20 sequences and the like) can be operably linked to a particular gene of interest. 

In one embodiment, a regulatory sequence is a non-native or non-naturally- 
occurring sequence (e.g., a sequence which has been modified, mutated, substituted, 
derivatized, deleted including sequences which are chemically synthesized). Preferred 
regulatory sequences include promoters, enhancers, termination signals, anti-termination 

25 signals and other expression control elements (e.g., sequences to which repressors or 
inducers bind and/or binding sites for transcriptional and/or translational regulatory 
proteins, for example, in the transcribed mRNA). Such regulatory sequences are 
described, for example, in Sambrook, J., Fritsh, E. F., and Maniatis, T. Molecular 
Cloning: A Laboratory Manual. 2nd, ed, Cold Spring Harbor Laboratory, Cold Spring 

30 Harbor Laboratory Press, Cold Spring Harbor, NY, 1 989. Regulatory sequences include 
those which direct constitutive expression of a nucleotide sequence in a microorganism 
(e.g., constitutive promoters and strong constitutive promoters), those which direct 
inducible expression of a nucleotide sequence in a microorganism (e.g., inducible 
promoters, for example, xylose inducible promoters) and those which attenuate or 

35 repress expression of a nucleotide sequence in a microorganism (e.g, attenuation signals 
or repressor sequences). It is also within the scope of the present invention to regulate 
expression of a gene of interest by removing or deleting regulatory sequences. For 
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example, sequences involved in the negative regulation of transcription can be removed 
such that expression of a gene of interest is enhanced. 

In one embodiment, a recombinant nucleic acid molecule or recombinant vector 
of the present invention includes a nucleic acid sequence or gene that encodes at least 
5 one bacterial gene product (e.g., a pantothenate biosynthetic enzyme, an isoleucine- 
valine biosynthetic enzyme, or a CoaA biosynthetic enzyme, for example CoaA or 
CoaX) operably linked to a promoter or promoter sequence. Preferred promoters of the 
present invention include Bacillus promoters and/or bacteriophage promoters (e.g., 
bacteriophage which infect Bacillus). In one embodiment, a promoter is a Bacillus 

1 0 promoter, preferably a strong Bacillus promoter (e.g. , a promoter associated with a 
biochemical housekeeping gene in Bacillus or a promoter associated with a glycolytic 
pathway gene in Bacillus). In another embodiment, a promoter is a bacteriophage 
promoter. In a preferred embodiment, the promoter is from the bacteriophage SP01 . In 
a particularly preferred embodiment, a promoter is selected from the group consisting of 

1 5 Pi 5 , P 26 or P veg , for example, the promoters set forth in SEQ ID NO:39, SEQ ID NO:40 
or SEQ ID NO:41 . Additional preferred promoters include tef (the translational 
elongation factor (TEF) promoter) and pyc (the pyruvate carboxylase (PYC) promoter), 
which promote high level expression in Bacillus (e.g., Bacillus suhtilis). Additional 
preferred promoters, for example, for use in Gram positive microorganisms include, but 

20 are not limited to, the amyE promoter or phage SP02 promoters. Additional preferred 
promoters, for example, for use in Gram negative microorganisms include, but are not 
limited to tac, trp, let, trp-tei, Ipp, lac, Ipp-lac, laclq, T7, T5, T3, gal, trc, ara, SP6, X-P R 
orX-P L . 

In another embodiment, a recombinant nucleic acid molecule or recombinant 
25 vector of the present invention includes a terminator sequence or terminator sequences 
(e.g., transcription terminator sequences). The term "terminator sequences" includes 
regulatory sequences which serve to terminate transcription of a gene. Terminator 
sequences (or tandem transcription terminators) can further serve to stabilize mRNA 
(e.g., by adding structure to mRNA), for example, against nucleases. 
30 In yet another embodiment, a recombinant nucleic acid molecule or recombinant 

vector of the present invention includes sequences which allow for detection of the 
vector containing said sequences (i.e., detectable and/or selectable markers), for 
example, sequences that overcome auxotrophic mutations, for example, ura3 or ilvE, 
fluorescent markers, and/or colorimetric markers (e.g., /acZ/p-galactosidase), and/or 
35 antibiotic resistance genes (e.g. , amp or tet). 
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In yet another embodiment, a recombinant nucleic acid molecule or recombinant 
vector of the present invention includes an artificial ribosome binding site (RBS). The 
term "artificial ribosome binding site (RBS)" includes a site within an mRNA molecule 
(e.g., coded within DNA) to which a ribosome binds (e.g., to initiate translation) which 
5 differs from a native RBS (e.g., a RBS found in a naturally-occurring gene) by at least 
one nucleotide. Preferred artificial RBSs include about 5-6, 7-8, 9-10, 11-12, 13-14, 15- 
16, 17-18, 19-20, 21-22, 23-24, 25-26, 27-28, 29-30 or more nucleotides of which about 
1-2, 3-4, 5-6, 7-8, 9-10, 1 1-12, 13-15 or more differ from the native RBS (e.g., the 
native RBS of a gene of interest). Preferably, nucleotides which differ are substituted 

10 such that they are identical to one or more nucleotides of an ideal RBS (e.g., SEQ ID 
NO:44, SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:47 or SEQ ID NO:48), when 
optimally aligned for comparisons. Artificial RBSs can be used to replace the naturally- 
occurring or native RBS associated with a particular gene. Artificial RBSs preferably 
increase translation of a particular gene. Preferred artificial RBSs (e.g., RBSs for 

1 5 increasing the translation of panB, for example, of B. subtilis panB) are depicted in 
Table IA (e.g., SEQ ID NO:49 and SEQ ID NO:50). 

Table I A: Preferred panB Ribosome Binding Sites 
20 10 20 



Additional preferred artificial RBSs (e.g., RBSs for increasing the translation of 
panD, for example, of B. subtilis panD) are depicted in Table IB (e.g., SEQ ID NO:5 1, 
SEQ ID NO:52, SEQ ID NO:53 and SEQ ID NO:54). 

30 

Table IB: Preferred panD Ribosome Binding Sites 



25 TAAACAT-G- - AGGAGGAGAAAAC ATG 



AGAAAGGAGGTGA 

CCCTCT - AG - AAGGAGGAGAAAACATG 
CCCTCT-AG- - AGGAGGAGAAAACATG 



ideal RBS (SEQ ID NO: 44) 

RBS1 (SEQ ID NO: 49) 

RBS 2 (SEQ ID NO: SO) 

panB native RBS (SEQ ID NO: 42) 



10 



20 



3 5 CTAGAAAAGGAGGAATTTAAATG 



pAN423 RBS (SEQ ID NO: 88) 
ideal RBS (SEQ ID NO: 4 5) 



TT AAG AAAGG AG GTG ANNNNATG 



TTAGAAAGGAGGATTTAAATATG 



new design A (SEQ ID NO: 51) 
new design B (SEQ ID NO: 52) 
new design CI (SEQ ID NO: 53) 
new design C2 (SEQ ID NO: 54) 
ideal RBS (SEQ ID NO: 46) 



TTAGAAAGG AGG TTTAATT AATG 



40 TTAGAAAGGAGGTGATTTAAATG 
TTAGAAAGGAGGTGTTTAAAATG 
TTAGAAAGGAGGTGANNNNNATG 
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Additional preferred artificial RBSs (e.g., RBSs for increasing the translation of 
panD, for example, of B. subtilis panD) are depicted in Table 1C (e.g., SEQ ID NO:55, 
SEQ ID NO:56 and SEQ ID NO:57). The predicted amino acid sequence at the C- 
5 terminus of the PanC protein is shown. The start codon for PanD translation is 
underlined. 



Table 1C: Additional Preferred panD Ribosome Binding Sites 

10 20 

10 I J 

-A GAA AGG AGG TGA NNN NNN N ATG ideal RBS (SEQ ID NO: 47) 



15 



ATT CGA GAA ATG GAG AGA ATA TAA T ATG native panD RBS (SEQ ID NO: 43) 
He Arg Glu Met Glu Arg He * Met SEQ ID NO: 89 

--- --A GAA AGG AGG TGA NNN NNN N ATG ideal RBS (SEQ ID NO: 47) 



ATT CGA GAA AGG AGG TGA ATA TAA 
He Arg Glu Arg Arg * 

20 

ATT CGA GAA AGG AGG TGA ATA ATA 
He Arg Glu Arg Arg * 

ATT CGT AGA AAG GAG GTG AAT TAA 
25 He Arg Arg Lys Glu Val Asn * 

AGA AAG GAG GTG ANN NNN 



T ATG NDI (SEQ ID NO: 55) 

Met SEQ ID NO: 90 

- ATG NDII (SEQ ID NO:56) 

Met SEQ ID NO: 90 

T ATG NDIII (SEQ ID NO: 57) 

Met SEQ ID NO: 91 

N ATG ideal RBS (SEQ ID NO: 48) 



30 Accordingly, in one embodiment, a vector of the present invention includes an 

artificial RBS as set forth in SEQ ID NO:49 or SEQ ID NO:50. In another embodiment, 
a vector of the present invention includes an artificial RBS as set forth in SEQ ID 
NO:51, SEQ ID NO:52, SEQ ID NO:53 or SEQ ID NO:54. In yet another embodiment, 
a vector of the present invention includes an artificial RBS as set forth in SEQ ID 

35 NO:55, SEQ ID NO:56 or SEQ ID NO:57. 

In another embodiment, a recombinant vector of the present invention includes 
sequences that enhance replication in bacteria (e.g., replication-enhancing sequences). 
In one embodiment, replication-enhancing sequences are derived from E. coli. In 
another embodiment, replication-enhancing sequences are derived from pBR322 (e.g., 

40 sequences included within the pBR322 derived portion of any of the pAN vectors as set 
forth in the Figures , i.e., the Not I-Not I sequences from about 5.0 kB to 9.0 kB of the 
vector depicted in Figure 3A). 
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In yet another embodiment, a recombinant vector of the present invention 
includes antibiotic resistance genes. The term "antibiotic resistance genes" includes 
sequences which promote or confer resistance to antibiotics on the host organism (e.g., 
Bacillus), In one embodiment, the antibiotic resistance genes are selected from the 
5 group consisting of cat (chloramphenicol resistance) genes, tet (tetracycline resistance) 
genes, erm (erythromycin resistance) genes, neo (neomycin resistance) genes and spec 
(spectinomycin resistance) genes. Recombinant vectors of the present invention can 
further include homologous recombination sequences (e.g., sequences designed to allow 
recombination of the gene of interest into the chromosome of the host organism). For 
1 0 example, amyE sequences can be used as homology targets for recombination into the 
host chromosome. 

Preferred vectors of the present invention include, but are not limited to, vectors 
set forth in Figures 2-15, 17, 19, 22, 25 and 26. It will further be appreciated by one of 
skill in the art that the design of a vector can be tailored depending on such factors as the 
15 choice of microorganism to be genetically engineered, the level of expression of gene 
product desired and the like. 

IX. Isolated Proteins 
Another aspect of the present invention features isolated proteins {e.g., isolated 

20 pantothenate biosynthetic enzymes and/or valine-isoleucine biosynthetic enzymes and/or 
isolated CoA biosynthetic enzymes, for example isolated CoaA or CoaX). In one 
embodiment, proteins (e.g., isolated pantothenate biosynthetic enzymes and/or valine- 
isoleucine biosynthetic enzymes and/or isolated CoaA biosynthetic enzymes, for 
example isolated CoaA or CoaX) are produced by recombinant DNA techniques and can 

25 be isolated from microorganisms of the present invention by an appropriate purification 
scheme using standard protein purification techniques. In another embodiment, proteins 
are synthesized chemically using standard peptide synthesis techniques. 

An "isolated" or "purified" protein (e.g., an isolated or purified biosynthetic 
enzyme) is substantially free of cellular material or other contaminating proteins from 

30 the microorganism from which the protein is derived, or substantially free from 
chemical precursors or other chemicals when chemically synthesized. In one 
embodiment, an isolated or purified protein has less than about 30% (by dry weight) of 
contaminating protein or chemicals, more preferably less than about 20% of 
contaminating protein or chemicals, still more preferably less than about 10% of 

35 contaminating protein or chemicals, and most preferably less than about 5% 
contaminating protein or chemicals. 
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In a preferred embodiment, the protein or gene product is derived from Bacillus 
(e.g., is Bacillus-derived). The term "derived from Bacillus" or "Ztoc/7/w.y-derived" 
includes a protein or gene product which is encoded by a Bacillus gene. Preferably, the 
gene product is derived from a microorganism selected from the group consisting of 
5 Bacillus subtilis, Bacillus lentimorbus, Bacillus lentus, Bacillus firmus, Bacillus 

pantothenticus, Bacillus amyloliquefaciens, Bacillus cereus, Bacillus circulars, Bacillus 
coagulans, Bacillus licheniformis, Bacillus megaterium, Bacillus pumilus, Bacillus 
thuringiensis, and other Group 1 Bacillus species, for example, as characterized by 16S 
rRNA type (Priest, supra). In another preferred embodiment, the protein or gene 

10 product is derived from Bacillus brevis or Bacillus stearothermophilus. In another 
preferred embodiment, the protein or gene product is derived from a microorganism 
selected from the group consisting of Bacillus licheniformis, Bacillus amyloliquefaciens, 
Bacillus halodurans, Bacillus subtilis, and Bacillus pumilus. In a particularly preferred 
embodiment, the protein or gene product is derived from Bacillus subtilis (e.g., is 

1 5 Bacillus subtilis-derived). The term "derived from Bacillus subtilis" or "Bacillus 
subtilis-derived" includes a protein or gene product which is encoded by a Bacillus 
subtilis gene. In yet another preferred embodiment, the protein or gene product is 
encoded by a Bacillus gene homologue (e.g., a gene derived from a species distinct from 
Bacillus but having significant homology to a Bacillus gene of the present invention, for 

20 example, a Bacillus pan gene or Bacillus ilv gene). 

Included within the scope of the present invention are bacterial -derived proteins 
or gene products and/or Bacillus-derived proteins or gene products (e.g., B. subtilis- 
derived gene products) that are encoded by naturally-occurring bacterial and/or Bacillus 
genes (e.g., B. subtilis genes), for example, the genes identified by the present inventors, 

25 for example, Bacillus or B. subtilis coaX genes, coaA genes, pan genes and/or ilv genes. 
Further included within the scope of the present invention are bacterial-derived proteins 
or gene products and/or Ztaa7/t/s-derived proteins or gene products (e.g., B. subtilis- 
derived gene products) that are encoded bacterial and/or Bacillus genes (e.g., B. subtilis 
genes) which differ from naturally-occurring bacterial and/or Bacillus genes (e.g., B. 

30 subtilis genes), for example, genes which have nucleic acids that are mutated, inserted or 
deleted, but which encode proteins substantially similar to the naturally-occurring gene 
products of the present invention. For example, it is well understood that one of skill in 
the art can mutate (e.g., substitute) nucleic acids which, due to the degeneracy of the 
genetic code, encode for an identical amino acid as that encoded by the naturally- 

35 occurring gene. Moreover, it is well understood that one of skill in the art can mutate 
(e.g., substitute) nucleic acids which encode for conservative amino acid substitutions. 
It is further well understood that one of skill in the art can substitute, add or delete 
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amino acids to a certain degree without substantially affecting the function of a gene 
product as compared with a naturally-occurring gene product, each instance of which is 
intended to be included within the scope of the present invention. 

In a preferred embodiment, an isolated protein of the present invention (e.g., an 
5 isolated pantothenate biosynthetic enzyme and/or an isolated isoleucine- valine 
biosynthetic enzyme and/or an isolated CoaA biosynthetic enzymes, for example 
isolated CoaA or CoaX) has an amino acid sequence shown in SEQ ID NO:3, SEQ ID 
NO:9, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID 
NO:32, SEQ ID NO:34, SEQ ID NO:36, SEQ ID NO:38 or SEQ ID NO:87. In other 
10 embodiments, an isolated protein of the present invention is a homologue of the at least 
one of the proteins set forth as SEQ ID NO:3, SEQ ID NO:9, SEQ ID NO:24, SEQ ID 
NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, SEQ ID 
NO:36, SEQ ID NO:38 or SEQ ID NO:87 (e.g., comprises an amino acid sequence at 
least about 30-40% identical, preferably about 40-50% identical, more preferably about 
15 50-60% identical, and even more preferably about 60-70%, 70-80%, 80-90%, 90-95% or 
more identical to the amino acid sequence of SEQ ID NO:3, SEQ ID NO:9, SEQ ID 
NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID 
NO:34, SEQ ID NO:36, SEQ ID NO:38 or SEQ ID NO:87, and has an activity that is 
substantially similar to that of the protein encoded by the amino acid sequence of SEQ 
20 ID NO:3, SEQ ID NO:9, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID 
NO:30, SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:36, SEQ ID NO:38 or SEQ ID 
NO:87, respectively. 

To determine the percent homology of two amino acid sequences or of two 
nucleic acids, the sequences are aligned for optimal comparison purposes (e.g., gaps can 
25 be introduced in the sequence of a first amino acid or nucleic acid sequence for optimal 
alignment with a second amino or nucleic acid sequence). When a position in the first 
sequence is occupied by the same amino acid residue or nucleotide as the corresponding 
position in the second sequence, then the molecules are identical at that position. The 
percent identity between the two sequences is a function of the number of identical 
30 positions shared by the sequences (i.e., % identity = # of identical positions/total # of 
positions x 100), preferably taking into account the number of gaps and size of said gaps 
necessary to produce an optimal alignment. 

The comparison of sequences and determination of percent homology between 
two sequences can be accomplished using a mathematical algorithm. A preferred, non- 
35 limiting example of a mathematical algorithm utilized for the comparison of sequences 
is the algorithm of Karlin and Altschul (1990) Proc. Natl Acad Sci. USA 87:2264-68, 
modified as in Karlin and Altschul (1993) Proc. Natl Acad. Sci. USA 90:5873-77. Such 
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an algorithm is incorporated into the NBLAST and XBLAST programs (version 2.0) of 
Altschul etal. (1990)7. MoL Biol. 215:403-10. BLAST nucleotide searches can be 
performed with the NBLAST program, score = 100, wordlength = 12 to obtain 
nucleotide sequences homologous to nucleic acid molecules of the invention. BLAST 
5 protein searches can be performed with the XBLAST program, score = 50, wordlength = 
3 to obtain amino acid sequences homologous to protein molecules of the invention. To 
obtain gapped alignments for comparison purposes, Gapped BLAST can be utilized as 
described in Altschul et al (1997) Nucleic Acids Research 25(17):3389-3402. When 
utilizing BLAST and Gapped BLAST programs, the default parameters of the respective 

10 programs (e.g., XBLAST and NBLAST) can be used. See http://www.ncbi.nlm.nih.gov. 
Another preferred, non-limiting example of a mathematical algorithm utilized for the 
comparison of sequences is the algorithm of Myers and Miller (1988) Comput Appl 
Biosci. 4:1 1-17. Such an algorithm is incorporated into the ALIGN program available, 
for example, at the GENESTREAM network server, IGH Montpellier, FRANCE 

1 5 (http://vega.igh.cnrs.fr) or at the ISREC server (http://www.ch.embnet.org). When 
utilizing the ALIGN program for comparing amino acid sequences, a PAM120 weight 
residue table, a gap length penalty of 12, and a gap penalty of 4 can be used. 

In another preferred embodiment, the percent homology between two amino acid 
sequences can be determined using the GAP program in the GCG software package 

20 (available at http://www.gcg.com), using either a Blossom 62 matrix or a PAM250 
matrix, and a gap weight of 12, 10, 8, 6, or 4 and a length weight of 2, 3, or 4. In yet 
another preferred embodiment, the percent homology between two nucleic acid 
sequences can be accomplished using the GAP program in the GCG software package 
(available at http://www.gcg.com), using a gap weight of 50 and a length weight of 3. 

25 

X. Biotransformations and Byconversions 

Another aspect of the present invention includes biotransformation processes 
which feature recombinant microorganisms (e.g., mutant microorganisms) and/or 
isolated CoA, pantothenate or isoleucine-valine biosynthetic enzymes described herein. 

30 The term "biotransformation process", also referred to herein as "byconversion 
processes", includes biological processes which result in the production (e.g., 
transformation or conversion) of any compound (e.g., intermediate or product) which is 
upstream of a CoA, pantothenate or isoleucine-valine biosynthetic enzyme to a 
compound (e.g., substrate, intermediate or product) which is downstream of said CoA, 

35 pantothenate or isoleucine-valine biosynthetic enzyme. 



WO 01/21772 



PCT/US00/25993 



-54- 

In one embodiment, the invention features a biotransformation process for the 
production of a panto-compound comprising contacting a microorganism which 
overexpresses at least one pantothenate biosynthetic enzyme with at least one 
appropriate substrate or precursor under conditions such that said panto-compound is 
5 produced and recovering said panto-compound. In a preferred embodiment, the 

invention features a biotransformation process for the production of pantoate comprising 
contacting a microorganism which overexpresses ketopantoate reductase (the panE gene 
product) with an appropriate substrate (e.g., ketopantoate) under conditions such that 
pantoate is produced and recovering said pantoate. In another preferred embodiment, 

1 0 the invention features a biotransformation process for the production of pantothenate 
comprising contacting a microorganism which overexpresses ketopantoate reductase and 
pantothenate synthetase with appropriate substrates (e.g., ketopantoate and p-alanine) 
under conditions such that pantothenate is produced and recovering said pantothenate. 
In yet another preferred embodiment, the invention features a biotransformation process 

1 5 for the production of pantothenate comprising contacting a microorganism which 
overexpresses ketopantoate hydroxymethyltransferase, ketopantoate reductase and 
pantothenate synthetase with appropriate substrates {e.g., ot-ketoisovalerate and 0- 
alanine) under conditions such that pantothenate is produced and recovering said 
pantothenate. Preferred recombinant microorganisms for carrying out the above- 

20 described biotransformations include pantothenate kinase mutants. Conditions under 
which pantoate or pantothenate are produced can include any conditions which result in 
the desired production of pantoate or pantothenate, respectively. 

In yet another embodiment, the present invention includes a method of 
producing P-alanine that includes culturing a microorganism which overexpresses 

25 aspartate-a-decarboxylase under conditions such that P-alanine is produced. Preferably, 
the aspartate-a-decarboxylase-overexpressing microorganism has a mutation in a nucleic 
acid sequence encoding a pantothenate biosynthetic enzyme selected from the group 
consisting of ketopantoate hydroxymethyltransferase, ketopantoate reductase and 
pantothenate synthetase. 

30 The invention further features a method of producing p-alanine that includes 

contacting a composition comprising aspartate with an isolated Bacillus aspartate-a- 
decarboxylase enzyme under conditions such that p-alanine is produced {e.g., an in vitro 
synthesis method). 

The microorganism(s) and/or enzymes used in the biotransformation reactions 
35 are in a form allowing them to perform their intended function {e.g., producing a desired 
compound). The microorganisms can be whole cells, or can be only those portions of 
the cells necessary to obtain the desired end result. The microorganisms can be 
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suspended (e.g., in an appropriate solution such as buffered solutions or media), rinsed 
(e.g., rinsed free of media from culturing the microorganism), acetone-dried, 
immobilized (e.g., with polyacrylamide gel or k-carrageenan or on synthetic supports, 
for example, beads, matrices and the like), fixed, cross-linked or permeablized (e.g., 
5 have permeablized membranes and/or walls such that compounds, for example, 
substrates, intermediates or products can more easily pass through said membrane or 
wall). 

Purified or unpurified CoA biosynthetic enzyme(s) (e.g., CoaA and/or CoaX), 
pantothenate biosynthetic enzyme(s) and/or valine-isoleucine biosynthetic enzyme(s) 

10 can also be used in biotransformation reactions. The enzyme can be in a form that 
allows it to perform its intended function (e.g., obtaining the desired compound). For 
example, the enzyme can be in free form or immobilized. Purified or unpurified CoA 
biosynthetic enzyme(s), pantothenate biosynthetic enzyme(s) and/or valine-isoleucine 
biosynthetic enzyme(s) can be contacted in one or more in vitro reactions with 

1 5 appropriate substrate(s) such that the desired product is produced. 

With respect to at least the above-described methodologies (e.g., the production 
methodologies of the present invention), at least one aspect of the invention features the 
folowing: embodiments is which the methods do not use microorganisms of the genus 
Corynebacterium and/or microorganisms of the genus Escherichia', embodiments in 

20 which the methods do not use microorganisms selected from the group consiting of 

Escherichia coli and Corynebacterium glutamicum; embodiments in which the methods 
do not use gram negative microorganisms; embodiments in which the microorganisms 
utilized do not include, express or produce nucleic acid molecules, genes or proteins 
(e.g., biosynthetic emzymes) derived from microorganisms of the genus 

25 Corynebacterium and/or microorganisms of the genus Escherichia; embodiments in 
which the microorganisms to not include, express or produce nucleic acid molecules, 
genes or proteins (e.g., biosynthetic emzymes) derived from microorganisms selected 
from the group consisting of Escherichia coli and Corynebacterium glutamicum. 

30 XL Screening Assays 

Because CoA is an essential factor in bacteria, proteins (e.g., enzymes) involved 
in the biosynthesis of CoA provide valuable tools in the search for novel anti-biotics. In 
particular, the CoaX protein is a valuable target for identifying bacteriocidal compounds 
because it bears no resemblance in primary sequence to mammalian pantothenate kinase 

35 enzymes. Accordingly, the present invention also provides a method (also referred to 
herein as a "screening assay") for identifying modulators, i.e., candidate or test 
compounds or agents (e.g., peptides, peptidomimetics, small molecules or other drugs) 
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which bind to CoaX, or have a stimulatory or inhibitory effect on, for example, coaX 
expression or CoaX activity. 

In one embodiment, the invention provides assays for screening candidate or test 
compounds which are capable of binding to CoaX proteins or a biologically active 
5 portion thereof. In another embodiment, the invention provides assays for screening 
candidate or test compounds which modulate the activity of CoaX proteins or 
biologically active portions thereof. The test compounds of the present invention can be 
obtained using any of the numerous approaches in combinatorial library methods known 
in the art, including: biological libraries; spatially addressable parallel solid phase or 

10 solution phase libraries; synthetic library methods requiring deconvolution; the 'one- 
bead one-compound 1 library method; and synthetic library methods using affinity 
chromatography selection. The biological library approach is limited to peptide 
libraries, while the other four approaches are applicable to peptide, non-peptide 
oligomer or small molecule libraries of compounds (Lam, K.S. (1997) Anticancer Drug 

15 Des. 12:145). 

Examples of methods for the synthesis of molecular libraries can be found in the 
art, for example in: DeWitt et al (1993) Proc. Natl. Acad. ScL U.S.A. 90:6909; Erb et 
al (1994) /Voc. Natl Acad. ScL USA 91:1 1422; Zuckermann etal. (1994)../ Med 
Chem. 37:2678; Cho etal (1993) Science 261:1303; Carrell etal (1994) Angew. Chem. 

20 Int. Ed Engl 33:2059; Carell etal (1994) Angew. Chem. Int. Ed. Engl 33:2061; and in 
Gallop et al (1994) J. Med. Chem. 37:1233. Libraries of compounds may be presented 
in solution (e.g., Houghten (1992) Biotechniques 13:412-421), or on beads (Lam (1991) 
Nature 354:82-84), chips (Fodor (1993) Nature 364:555-556), bacteria (Ladner USP 
5,223,409), spores (Ladner USP '409), plasmids (Cull et al (1992) Proc Natl Acad Sci 

25 USA 89:1865-1869) or on phage (Scott and Smith (1990) Science 249:386-390); (Devlin 
(1990) Science 249:404-406); (Cwirla et al (1990) Proc. Natl Acad. Sci. 87:6378- 
6382); (Felici (1991) J. Mol Biol 222:301-310); (Ladner supra.). 

In one embodiment, an assay is a microorganism-based assay in which a 
recombinant microorganism which expresses a CoaX protein or biologically active 

30 portion thereof is contacted with a test compound and the ability of the test compound to 
modulate CoaX activity is determined. Determining the ability of the test compound to 
modulate CoaX activity can be accomplished by monitoring, for example, intracellular 
phosphopanthoate or CoA concentrations or secreted pantothenate concentrations (as 
compounds that inhibit CoaX will result in a buildup of pantothenate in the test 

35 microorganism). CoaX substrate can be labeled with a radioisotope or enzymatic label 
such that modulation of CoaX activity can be determined by detecting a conversion of 
labeled substrate to intermediate or product. For example, CoaX substrates can be 
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labeled with ^-P, 14 C, or ^H, either directly or indirectly, and the radioisotope detected 
by direct counting of radioemmission or by scintillation counting. Determining the 
ability of a compound to modulate CoaX activity can alternatively be determined by 
detecting the induction of a reporter gene (comprising a CoA-responsive regulatory 
5 element operatively linked to a nucleic acid encoding a detectable marker, e.g., 
luciferase), or detecting a CoA-regulated cellular response. 

In yet another embodiment, a screening assay of the present invention is a cell- 
free assay in which a CoaX protein or biologically active portion thereof is contacted 
with a test compound in vitro and the ability of the test compound to bind to or modulate 

10 the activity of the CoaX protein or biologically active portion thereof is determined. In 
a preferred embodiment, the assay includes contacting the CoaX protein or biologically 
active portion thereof with known substrates to form an assay mixture, contacting the 
assay mixture with a test compound, and determining the ability of the test compound to 
modulate enzymatic activity of the CoaX on its substrates. 

1 5 Screening assays can be accomplished in any vessel suitable for containing the 

microorganisms, proteins, and/or reactants. Examples of such vessels include microtiter 
plates, test tubes, and micro-centrifuge tubes. In more than one embodiment of the 
above assay methods of the present invention, it may be desirable to immobilize either 
CoaX protein or a recombinant microorganism expressing CoaX protein to facilitate 

20 separation of products and/or substrates, as well as to accommodate automation of the 
assay. For example, glutathione-S-transferase/CoaX fusion proteins can be adsorbed 
onto glutathione sepharose beads (Sigma Chemical, St. Louis, MO) or glutathione 
derivatized microtiter plates. Other techniques for immobilizing proteins on matrices 
(e.g., biotin-conjugation and streptavidin immobilization or antibody conjugation) can 

25 also be used in the screening assays of the invention. 

In another embodiment, modulators of CoaX expression are identified in a 
method wherein a cell is contacted with a candidate compound and the expression of 
coaXmKNA or CoaX polypeptide in the cell is determined. The level of expression in 
the presence of the candidate compound is compared to the level of expression in the 

30 absence of the candidate compound (or to a suitable control, for example, an appropriate 
buffer control or standard). The candidate compound can then be identified as a 
modulator of coaXmRNA or CoaX polypeptide expression based on this comparison. 

This invention further pertains to novel agents identified by the above-described 
screening assays. Accordingly, it is within the scope of this invention to further use an 

35 agent identified as described herein in an appropriate animal model. For example, an 
CoaX modulating agent identified as described herein (e.g., an anti-bactericidal 
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compound) can be used in an infectious animal model to determine the efficacy, 
toxicity, or side effects of treatment with such an agent. 

This invention is further illustrated by the following examples which should not 
be construed as limiting. The contents of all references, patents, patent applications 
5 (including U.S. Patent Application Serial No. 09/400,494, filed September 2 1 , 1 999 
(pending), provisional U.S. Patent Application Serial No. 60/210,072, filed June 7, 
2000, provisional U.S. Patent Application Serial No. 60/221,938, filed July 28, 2000 and 
provisional U.S. Patent Application Serial No. 60/227,860, filed August 24, 2000, to 
which this application relates) and published patent applications cited throughout this 
1 0 application are incorporated herein by reference. 
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EXAMPLES 

General Methodology: 

Strains. Bacillus subtilis strains of the present invention are generally derived 
from either of two strains. The first is variously named "168", "1 Al", or "RL-1". The 
5 genotype is trpC2. This strain was derived from the wild type "Marburg" strain by 
mutagenesis and has been the basis of much of the molecular biology work done on B. 
subtilis. The second strain is PY79, a prototrophic derivative of 168 that was made Trp + 
by transduction from the wild type strain W23. 

Media. Standard minimal medium for B. subtilis is comprised of 1 x Spizizen 
10 salts and 0.5% glucose. Standard solid "rich medium" is Tryptone Blood Agar Broth 
(Difco), and standard liquid "rich medium" is VY, a mixture of veal infusion broth and 
yeast extract. For testing production of pantothenate in liquid test tube cultures, an 
enriched form of VY, called "Special VY" or "SVY" is used. For batch fermentations, 
SVYG and PFMG are used. The compositions of these media are given below. 

15 VY, a rich liquid medium: 25 g Difco Veal Infusion Broth, 5 g Difco Yeast 

Extract, 1L water (autoclave). 

TBAB, a rich solid medium: 33 g Difco Tryptone Blood Agar Broth, 1 L water 
(autoclave). 

MIN } a minimal medium: 100 ml 10 x Spizizen salts; 10 ml 50% glucose; 2 ml 
20 10% arginine HC1*; 10 ml 0.8% tryptophan**; water to 1 liter. (*In some cases, 
arginine is omitted or replace by sodium glutamate at 0.04% final concentration. In 
general, B. subtilis grows faster in minimal medium when certain amino acids, such as 
arginine, glutamine, glutamate, or proline, are added as an auxiliary nitrogen source. ) 
(**For strains that are tryptophan auxotrophs, tryptophan is routinely added to most 
25 minimal media.) 

10 x Spizizen Salts: 1 74 g K 2 HP0 4 -3H 2 0; 20 g (NH 4 ) 2 S0 4 ; 60 g KH 2 P0 4 ; 1 0 g 
Na 3 Citrate-2H 2 0; 2 g MgS0 4 -7H 2 0; water to 993 mis; then add 3.5 ml FeCl 3 solution and 
3.5 ml Trace Elements solution. 

FeChSolution: 4 g FeCl 3 *6H 2 0; 197 g Na 3 Citrate«2H 2 0; water to 1 liter (filter 
30 sterilize) 

Trace Elements Solution: 0.15 g Na 2 Mo0 4 -2H 2 0; 2.5 g H 3 B0 3 ; 0.7 g 
CoCl 2 -6H 2 0; 0.25 g CuS0 4 -5H 2 0; 1.6 g MnCl 2 4H 2 0; 0.3 g ZnS0 4 -7H 2 0; water to 1 liter 
(filter sterilize). 
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SVY r Special VY, a supplemented* rich medium for testing pantothenate 
production in test tube cultures: 25 g Difco Veal Infusion Broth; 5 g Difco yeast 
extract; 5 g sodium glutamate; 2.7 g ammonium sulfate; 740 ml water (autoclave); add 
200 ml 1 M potassium phosphate, pH 7.0; 60 ml 50% glucose. (*For testing 
5 pantothenate production in liquid SVY test tube cultures, Na a-ketoisovalerate and/or P- 
alanine can be added to a concentration of 5 g/L from filter-sterilized stocks.) 

PFMGy a yeast extract based medium used in fermentors: 20 g Amberex 
1003™ yeast extract; 5 g sodium glutamate, 2 g ammonium sulfate; 5 g tryptophan; 10 g 
KH 2 P04; 20 g K 2 HP0 4 -3H 2 0; 1 g MgCl 2 -6H 2 0; 0.1 g CaCl 2 -2H 2 0; 1 g sodium citrate; 
10 0.01 g FeS0 4 -7H 2 0; 1 ml trace elements solution; 20 g glucose; add water to 1 L. 

Glucose or other sugars are fed as needed. Feed solutions can contain minerals, defined 
or food grade nutrients. 

PF t a chemically defined pantothenate free medium for testing pantothenate 
auxotrophy: 100 ml 10 x Spizizen Salts; 100 ml 1 x Difco Pantothenate Assay Medium; 
15 10 ml 50% glucose; water to 1 liter. 

For pantothenate auxotrophs, 1 mM Na pantothenate is added to both minimal 
and rich media, since there is generally not enough pantothenate in rich media to support 
B. subtilis pan mutants. Amino acids are at 100 mg per liter, when used. 
Selection for antibiotic resistance is done with 5 mg/L chloramphenicol, 100 mg/L 
20 spectinomycin HC1, 15 mg/L tetracycline HC1, or 1 mg/L erythromycin plus 25 mg/L 
lincomycin. 

Pantothenate Assays: Biological assay. The indicator organism, Lactobacillus 
plqntarum, requires pantothenate for growth, and responds to low concentrations (|ig/L). 

25 Thus, using serial dilutions, a wide range of concentrations can be assayed. 

Commercially available medium (e.g., Pantothenate Assay Medium (PAM), Difco), can 
be used. However, Difco PAM supplemented with pantothenate does not support 
growth to the same level as obtainable using a fresh-mixed version of Pantothenate 
Assay Medium (FM-PAM), made up of the individual components as specified by 

30 Difco, which is accordingly, routinely used instead of the commercial product. 

Before assaying B. subtilis culture supernatants, the B. subtilis cells must be 
either removed or killed. B. subtilis culture supernatants give approximately the same 
pantothenate titer when the supernatants are autoclaved as when they are sterile filtered. 
Accordingly, routine procedures involve autoclaving samples for 5 minutes prior to the 

35 biological assay. 
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Pantothenate Assays : HPLC assay. Pantothenic acid production is measured 
by HPLC with a detector wavelength of 197 nm and a reference at 450 nm. The 
procedure is a modification of one recommended by Hewlett-Packard for water soluble 
vitamins. Samples of culture broth are diluted into an equal volume of 60% acetronitrile 
5 (ACN), centrifiiged and filtered. Typically a further 10-fold dilution before analysis 
brings the final dilution to 20-fold. Higher concentrations of product are diluted further. 
Compounds are separated on a CI 8 Phenomenex 5n Aqua 250 x 4.6 mm column with 
5% acetronitrile (ACN) in 50 mM Na phosphate buffer at pH 2.5. An ACN gradient 
from 5% to 95% washes the column between every sample. The area of the 
10 pantothenate peak is proportional to the concentration between 5 to 1000 mg/L. Other 
panto-compounds are also separated and quantitated by this method. 

Amino Acid Analysis: HPLC assay. Amino acids present in the fermentation 
medium and throughout the fermentation are measured by HPLC with a detector 
wavelength of 338 nm and a reference at 390 nm. The procedure is a modification of 

1 5 one recommended by Hewlett-Packard for amino acid analysis. Samples of culture 
broth are prepared identically as for the panto-compound analysis. Compounds are 
separated on a C 1 8 Hypersil 5^i ODS 200 x 2. 1 mm column. Solvent A is 20 mM Na 
acetate buffer at pH 7.2. Solvent B contains 40% ACN and 40% methanol. A gradient 
from 100% Solvent A to 100% Solvent B separates amino acids and washes the column 

20 between every sample. 

Batch Fermentations. Pantothenate producing strains are grown in stirred tank 
fermentors, for example, in CF3000 Chemap 14 liter vessels with 10 liter working 
volumes. Computer control and data collection is by commercial software, for example, 
B. Braun Biotech MFCS software. Fermentations can be batch processes but are 
25 preferably sugar-limited, fed batch processes. Some media components (e.g. of SVYG 
and PMFG) are added to the fermentor and sterilized in place. Portions of the media are 
sterilized separately and added to the fermentors aseptically. This procedure is well 
known to those familiar with the art. Additional nitrogen sources in feeds are sterilized 
separately and added to the carbon source after cooling. 

30 The initial sugar in the medium is consumed in approximately 6 hours. 

Afterwards, glucose or other sugars are fed with the possible addition of minerals, and 
defined or food grade nutrients. Alternatively, feeds are scheduled based on a consensus 
profile of nutritional requirements from samples taken from earlier fermentations. 
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After inoculation, agitation is set at a relatively low speed, e.g. 200 rpm. When 
the dissolved oxygen (p02) falls to 30%, computer control automatically adjusts the 
agitation to maintain a dissolved oxygen concentration between 25 and 30% p02. 



5 EXAMPLE I: Enhanced Production of a Panto-Compound Using Bacteria 
Overexpressing panBCD Gene Products. 

This Example describes the cloning of the B. subtilis panBCD operon and the 
generation of microorganisms overexpressing the panBCD gene products. 

To clone the B. subtilis panBCD operon, a plasmid library of B. subtilis GP275 

10 (a derivative of 1 68) genomic DNA was transformed in E. coli BM4062 {birA^X and 
temperature resistant clones were selected at 42°C. By comparison of restriction maps 
to the genome sequence, one particular clone was deduced to contain the B. subtilis birA 
gene and the adjacent panBCD genes. This plasmid was named pAN201 . 

To overexpress the panBCD operon and produce pantothenate, the native 

1 5 promoter of the panBCD operon was replaced by either of two strong, constituti ve 
promoters derived from the B. subtilis bacteriophage SP0 1 . These two promoters are 
named P H and P, 3 . In addition, either of two artificial ribosome binding sites (RBSs) 
were used to replace the native panB RBS. These two artificial RBSs (set forth as SEQ 
ID NO:49 and SEQ ID NO:50) were predicted to increase translation of panBCD; their 

20 sequences are shown in Table 1 A. Three such engineered panBCD expression cassettes 
were built into circular plasmids capable of replicating in E. coli. Other features of the 
plasmids include a strong rho-independent transcription terminator from the E. coli 
ribosomal RNA transcription unit, called TiT 2 , a Gram-positive chloramphenicol 
resistance gene (cat), derived from pC194, and a pair of Notl restriction sites at the 

25 junctions between the E. coli replicon and the segment intended for integration into B. 
subtilis. Three plasmids of this series, pAN004, pAN005, and pAN006 were 
constructed. pAN004 contains the P 2 6 promoter, RBS1, and a low copy E. coli replicon. 
pAN005 contains the Pjs promoter, which in our experience is not as strong as P26, 
RBS1, and the low copy replicon. pAN006 contains the P 2 6 promoter, RBS2, and a 

30 medium copy replicon. 

The three panBCD expression cassettes contained in the above-mentioned three 
plasmids were all ligated to a DNA fragment consisting of sequences that naturally 
occur immediately upstream from the native panB gene and integrated in single copy by 
homologous recombination into the panBCD locus of B. subtilis strains RL-1 and PY79, 

35 replacing the wild-type operon. This was accomplished in two steps. First a deletion- 
substitution that replaced about two thirds of the panB coding region with a Gram- 
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positive spectinomycin resistance gene (spec) was integrated at panB to yield Spec", 
pantothenate auxotrophs. These intermediate strains were than transformed with the 
panBCD expression cassettes of pAN004, pANOOS, and pAN006 after ligating them to a 
DNA fragment containing chromosomal sequences just upstream of panB. Selection of 
5 the incoming cassette was for pantothenate prototrophy. The resulting strains were 
named PA221, PA222 and PA223 (from RL-1), and PA235, PA232 and PA233 (from 
P Y79), respectively. An example of a plasmid that contains the joined upstream 
sequence that is in the integrated strain in PA221 is pAN240 (see Figure 2). The 
nucleotide sequence of pAN240 is set forth as SEQ ID NO:76. 

10 Polymerase chain reaction using appropriate primers was used to verify the 

correct chromosomal structures of these engineered strains. When extracts of strain 
PA221 were examined by SDS-PAGE, two proteins were found to be overexpressed. 
One protein had an apparent molecular weight of 29,000 and the other protein appeared 
to be 39,000 daltons. The 29,000 dalton bands is presumably PanB (predicted molecular 

1 5 weight of 29,761 ). The larger protein band presumably represents PanC (predicted size 
31,960 daltons). 

The ability of these strains to produce pantothenate in test tube cultures was 
assesed as follows. Each strain was grown in SV Y medium supplemented with 5 g/L a- 
ketoisovalerate (a-KIV) and 5 g/L P-alanine, to ensure that these precursors were not 
20 limiting. Culture supernatants were autoclaved and assayed using the bioassay. 

Relative to the parent strains, RL-1 and PY79, the engineered strains produced about 8- 
to 30-fold more pantothenate, attaining 1 g/L pantothenate in some cases. 
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Table 2. Production of pantothenate by engineered B. subtilis strains in liquid test 
tube cultures grown in SVY medium with 5 g/L a-KlV and 5 g/L fi- 
alanine. 



Expt. 



Strain 



Promoter 



RBSat 
panB 



[pantothenate] 
mg/L 



1 RL-1 
PA221 

PA222 

PA223 

2 PY79 
PA235 

PA221 



Native 
P* 

P» 

P* 

Native 



Native 
RBS1 

RBS1 

RBS2 

Native 
RBS1 

RBS1 



30 

990 
790 

250 
250 

790 
790 

40 

930 
860 

1100 
1030 



5 The P 2 6 promoter was about 3- to 4-fold more effective than the P„ promoter, 

while RBS1 and RBS2 were roughly equivalent. Plasmids such as pAN004, pAN005, 
pAN006 can also be recombined as circles into the B. subtilis wild type panBCD locus 
by Campbell-type (single crossover) integration, selecting for chloramphenicol 
resistance at 5 mg/L. Strains obtained in this fashion produce about the same amount of 

1 0 pantothenate as strains PA22 1 , PA222, and PA223, respectively. pAN004 containing 
the P26 promoter, RBS1 and a low copy E. coli replicon, is depicted schematically in 
Figure 3A. The nucleotide sequence of plasmid pAN004 is set forth as SEQ ID NO:93. 
pAN006 containing the P 2 6 promoter, RBS2 and a medium copy E. coli replicon, is 
depicted schematically in Figure 3B. The nucleotide sequence of plasmid pAN006 is set 

1 5 forth as SEQ ID NO:94. The nucleotide sequence of panBCD is set forth as SEQ ID 
NO:59 and the predicted amino acid sequences of PanB, PanC and PanD are set forth as 
SEQ ID NO:24, SEQ ID NO:26 and SEQ ID NO:28, respectively. Methods for 
manipulating Bacilli are described, for example, in Harwood, C.R. and Cutting, S.M. 
(editors), Molecular Biological Methods for Bacillus (1990) John Wiley & Sons, Ltd., 

20 Chichester, England, the content of which is incorporated herein by reference. 
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EXAMPLE II: Enhanced Production of a Panto-Compound Using Bacteria 
Overexpressing the panEl Gene Product - Ketopantoate Reductase. 

This Example describes the cloning of the B. subtilis panEl gene and the 
generation of microorganisms overexpressing the panEl gene product. 

5 Pan" B. subtilis strains {e.g. , B. subtilis mutants blocked in the synthesis of 

pantothenic acid) had previously been isolated, one of which was reported to be affected 
in ketopantoate reductase activity (Baigori et al. (1991) J. Bacteriol 173:4240-4242). 
However, the mutations in these strains were incorrectly mapped to the purE-tre interval 
of the B. subtilis genetic map which does not contain the panE or panBCD genes. 

1 0 Furthermore as shown below, a panE mutant does not have a Pan* phenotype as the ilvC 
gene product can substitute for the panE gene product in B. subtilis as in other bacterial 
strains such as E. coli. More recently, the S. typhimuruim panE gene has been located 
and determined to be allelic to apbA, a gene required for anaerobic purine biosynthesis 
(Frodyma et al (1998) J. Biol. Chem. 273:5572-5576). E. coli carries a highly 

15 homologous gene at the same map location. Identification of the panE genes in E. coli 
and S. typhimurium was complicated by the fact that the ilvC gene product, 
acetohydroxy acid isomeroreductase, is also capable of carrying out the ketopantoate 
reductase reaction. As a result, pantothenate auxotrophy is not obtained unless both 
panE and ilvC are mutated. 

20 To identify the B. subtilis panEl gene, the B. subtilis genome was searched using 

the protein sequence of E. coli or S. typhimurium ApbA (PanE), and two open reading 
frames were identified having homology to ApbA, named ylbQ and ykpB. These genes 
were renamed panEl and panE2, due to their proposed function in pantothenate 
biosynthesis. Both panEl and panE2 were cloned as PCR products generated from 

25 RL-1 genomic DNA as a template. Both genes were disrupted by either a 

spectinomycin resistance gene (spec) or a chloramphenicol resistance gene (cat). The 
interrupted genes were each integrated by double crossover into PY79 to give PA240 
(ApanEl::spec) and PA241 (ispanE2: :cat). Neither of these strains were pantothenate 
auxotrophs when tested on pantothenate-free (PF) plates, although PA240 containing 

30 ApanEl::spec grew slightly more slowly on TBAB without added pantothenate than 
with a 1 mM pantothenate supplement. By comparison, a ApanB::spec strain does not 
produce single colonies on TBAB, presumably because B. subtilis has no active uptake 
system for pantothenate. 

It was hypothesized that the B. subtilis gene, ilvC, could function for panE as had 

35 been shown for E. coli. Accordingly, the panEl and panE2 disruptions were introduced 
into a strain, CU550, which is reported to be trpC2 ilvC4 leuC124. Both the single 
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panEl and the double panEl, panE2 disruptants were pantothenate auxotrophs on PF 
medium. 

Table 3. Phenotypes of various panEl and panE2 mutants on rich and defined 
5 media. 

Growth*: 



Strain 


Medium - pan 


+ pan 


PY79 


TBAB +++ 
PF ++ 


+++ 
++ 


PA240 


TBAB spec + 
PF ++ 


+++ 
++ 


PA241 


TBAB cam +++ 
PF ++ 


+++ 
++ 


CU550 


TBAB +++ 
PF ++ 


+++ 


PA256 


TBAB spec 
PF 


-hf-f 
-H- 


PA258 


TBAB spec, cam 


++4- 




PF 





*Each "+" represents about 1 mm of colony diameter after overnight at 37°C. 



Thus, mutating both panEl and ilvC results in pantothenate auxotrophy, while 
mutating only panEl does not, similar to what has been reported for E.coli and S. 
10 typhimurium. 

Next, the quantitative effect of panEl and panE2 knockouts in a pantothenate 
overproducing strain (PA235 described herein) was examined. The panEl and panE2 
disruptions were introduced into PA235, either singly or together to produce PA245 
(ApanEl ::spec), PA248 (ApanE2::cat) and PA244 (&panEl::cat, &panE2::spec). The 
1 5 effect of each mutation on pantothenate production was then tested in liquid test tube 
cultures. 
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Table 4. Pantothenate production by PA235 derivatives containing panEl and 
panE2 disruptions. 

Strain [pan] mg/L % of PA235 



PA235 990 (100) 

PA235 940 95 

PA245 59 6 

PA245 82 8 

PA248 1060 106 

PA248 1030 104 

PA244 25 3 

PA244 50 5 



Thus, deletion analysis indicated that the panEl gene contributes to over 90% of 
5 the pantothenate production, while deletion of panE2 did not have a significant effect on 
pantothenate production. It is therefore concluded that panEl accounts for most, but not 
necessarily all, of the ketopantoate reductase activity in B. subtilis. The rest of the 
ketopantoate reductase activity is predicted to be supplied by ilvC. 

Having identified panEl as an important gene for pantothenate production, 

1 0 increased panEl expression was tested to determine whether it could enhance 
pantothenate production in strains such as PA221 or PA235. The panEl coding 
sequence was installed downstream of the P26 promoter and RBS2 in a vector, pOTP61 , 
designed to integrate and amplify at either the bpr locus (a non-essential protease gene) 
or at the locus of the cloned insert. The resulting plasmid, pAN236 (Figure 4) was 

15 transformed into PA221 , selecting for resistance to tetracycline at 15 mg/L. The 
nucleotide sequence of pAN236 is set forth as SEQ ID NO:77. One transformant, 
named PA236 was chosen for further study. 

PA236 was shown to overexpress a protein of about 31,000 daltons, which is 
close to the expected molecular weight of 33,290 daltons for panEl protein. Briefly, 

20 whole cell extracts were prepared from PY79, RL-1, PA221, PA221/pOTP61and PA236 
(2 samples). Cell extracts were separated by gel electrophoresis and the gels were 
coomassie stained to visualize proteins. In cells engineered to overexpress panE 
(PA236-1 and PA236-2), a band was visible having an approximate molecular weight of 
~3 1,000 daltons (as compared to molecular weight markers). Moreover, PA221 and 

25 PA236 expressed increased levels of a -29,000 dalton band, corresponding to the panB 
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gene product, and a -39,000 dalton band, presumably corresponding the panC gene 
product. Furthermore, K coli transformed with pAN006 (Figure 3B) expressed bands 
correlating to the panB and panC gene products and E. coli transfected with PAN236 
expressed a -3 1,000 dalton band corresponding to the panE gene product. 

5 Next, PA236 was compared to PA221 carrying the empty vector pOTP61 for 

pantothenate production in liquid test tube cultures supplemented with 5 g/L P-alanine 
and 5 g/L a-KIV. 

Table 5. Effect of overexpression of panEl and panE2 on pantothenate 

10 production by engineered strains in liquid test tube cultur es. 

Strain Additional Gene [Pantothenate] 

Plasmid Overexpressed mg/L 

PA221 pOTP61 none 1,000 

940 

PA236 pAN236 panEl 2,030 

2,050 

PA238 pAN238 panE2 530 

680 

Overexpression of panEl caused a two-fold increase in pantothenate production 
when compared to the parent strain (e.g., to slightly over 2 g/L) whereas overexpression 
of panE2 resulted in a strain that produced about 35% less pantothenate than the parent 
1 5 strain. The panEl nucleotide sequence and predicted amino acid sequence are set forth 
as SEQ ID NO:29 and SEQ ID NO:30. 

EXAMPLE III: Enhanced Production of a Panto-Compound by Culturing 
Bacteria Overexpressing panEl or panBCD in the Presence of Valine. 

20 The ability of valine to function as a media supplement (e.g., as a substitute for 

a-KIV) in strains engineered to overexpress the panBCD operon and panEl was 
evaluated. Valine is closely related to a-KIV by transamination, is less expensive than 
a-KIV, and is commercially available in kilogram quantities. Valine was substituted for 
a-KIV in the standard liquid test tube cultures in SVY medium. The concentration of 

25 valine was varied from 5 to 50 g/L. Although valine at 5 g/L was slightly less effective 
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than a-KIV in promoting pantothenate production, valine at 10 or 20 g/L equaled or 
surpassed 5 g/L a-KIV in promoting pantothenate production. 

EXAMPLES IV-X Generation of Microorganisms Capable of Producing 
5 Pantothenate in a Precursor-Independent Manner 

B. subtilis strains such as PA221 and PA235 (engineered to overexpress 
panBCD) and PA236 (engineered to overexpress panBCD and panEl) need to be fed a- 
ketoisovalerate (a-KIV) (or valine) and aspartate (or P-alanine) to achieve maximal 
pantothenate production, as both these precursors are limiting for pantothenate 
10 synthesis. Accordingly, manipulated microorganisms were designed to eliminate the 
need to feed limiting precursors of pantothenate biosynthesis in the production of 
pantothenate. These strains are also useful in the production of various pantothenate 
biosynthetic pathway intermediates. 

1 5 EXAMPLE IV: Generation of Microorganisms Capable of Producing 
Pantothenate in an Aspartate- (or p-AIanine) Independent Manner 

The panD gene was cloned into B. subtilis expression vector pOTP61 to 
construct pAN423 (Figure 5). The nucleotide sequence of pAN423 is set forth as SEQ 
ID NO:78. The Not\ restriction fragment containing panD was isolated from pAN423, 
20 self ligated and used to transform PA221 . Transformants resistant to Tet 15 , Tet 30 , and 
Tet 60 were isolated and saved for further analysis. 

Six of the pAN423 transformants plus two control transformants were grown in 
SVY containing 5 g/1 a-KIV with and without 10 g/1 aspartate and then assayed for 
pantothenate production (Table 6). 

25 

Table 6. Effect of overproducing PanD on pantothenate production with and without 
added aspartate. 



Culture* 
(PA221 transformants) 


Asp 
(10 g/L) 


TetR** 
(Hg/ml) 


OD550 


[pan] 
(mg/L) 












pOTP61-l 




60 


8.0 


76 


pOTP61-2 




60 


7.7 


91 


423#1-1 




15 


8.5 


180 


423#l-2 




15 


8.0 


150 


423#l-3 




30 


8.3 


220 


423#l-4 




30 


8.5 


280 


423#l-5 




60 


8.9 


580 


423#l-6 




60 


8.8 


280 
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pOTP61-l 


+ 


60 


7.5 


380 


pOTP61-2 


+ 


60 


6.9 


560 


423# 1 - 1 


+ 


15 


8.5 


1200 


423#l-2 


+ 


15 


8.6 


1000 


423#l-3 


+ 


30 


8.8 


1200 


423#l-4 


+ 


30 


9.0 


1200 


423#l-5 


+ 


60 


9.0 


1200 


423#l-6 


+ 


60 


9.0 


1200 



Test tubes cultures were grown in SVY + a-KIV (5 g/L) with Asp (10 g/L) where 
indicated. 

**TetR = Approximate Tet-resistance of transformant 



The pAN423 transformants produced at least twice the amount of pantothenate 
5 as the controls (i.e. , to a level at or near that which was obtained in earlier experiments 
by the addition of P-alanine to the culture medium). The data also show that in the 
absence of added aspartate, transformants containing additional copies of the panD gene 
expression cassette produce more pantothenate than the control transformants. One of 
the transformants, 423#l-5, produced about five times as much pantothenate as the 
10 controls. These results indicated that increased levels of PanD protein "pull" the 
conversion of available aspartate towards p-alanine, and that increasing panD gene 
expression can result in enhancement of pantothenate production both in the presence 
and absence of added aspartate. 

Transformant 423#l-5 was re-named strain PA401 and studied further in shake 
15 flask fermentations. The shake flask medium was SVY with maltose instead of SVY 
with glucose. Results of shake flask experiments agreed well with test tube experiments 
during the first 24 hours. In shake flask experiments without the addition of P-alanine, 
PA401 produced approximately 1 .5 g/1 of pantothenate in 24 hours. Addition of P- 
alanine to the culture medium did not further improve pantothenate titers (Table 7), 
20 indicating that with this strain and these fermentation conditions, P-alanine is not 

limiting pantothenate production. In fact, when no p-alanine is fed, one can observe that 
PA401 is secreting p-alanine in significant amounts into the medium. 
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Table 7. Shake flask cultures with strain PA401 (panD) with and without ^-alanine. 





Amino acids (g/l) 


24 hours 


Initial 












P-ala 
Added 


P-ala 


Val 


pH 


OD600 


Pantothenate 
(g/l) 


0 


0.7 


1.5 


7.5 


13.7 


1.5 


5 g/l 


7.1 


1.4 


7.6 


12.4 


1.5 



Each value represents the average of duplicate 250 ml baffled flasks containing 50 ml of 
5 medium, incubated at 37°C with shaking (200 rpm). 
Base Medium: SVY with 10 g/l a-KIV, 30 g/l maltose 
2% Inoculum: SVY with Tet 15 grown 24 hours. 

EXAMPLE V: Engineering the panD gene for Further Increased Synthesis of 
10 Aspartate Decarboxylase and Enhanced Production of Pantothenate 

This Example describes the generation of improved ribosome binding sites 
(RBSs) in the panD gene to increase the translation of panD mRNA. 

Increasing the translation of the panD gene mRNA by generation of synthetic panD 
RBSs 

1 5 The RBS (SEQ ID NO:88) used to express panD in pAN423 is a synthetic RBS 

and has been used to successfully produce other proteins in B. subtilis at a high level. 
However, it contains six mismatches when aligned to the "ideal" B. subtilis RBS (SEQ 
ID NO:45) (e.g., an RBS having a sequence which is complementary to the 16S RNA 
sequence within the B.subtilis ribosome). (See e.g., Table IB, mismatches in bold). 

20 Two new RBSs were designed to more closely mimic the ideal RBS. These synthetic 
RBSs, named new design A (NDA) and new design B (NDB) (also referred to herein as 
RBS3 and RBS4), are set forth as SEQ ID NO:5 1 and SEQ ID NO:52 and are aligned 
with the ideal RBS in Table IB. 

Oligonucleotides corresponding to the top and bottom strands of each new RBS 
25 were synthesized, annealed, then used to replace the RBS in pAN420, generating 
plasmids pAN426 and pAN427. These constructions are illustrated in Figure 6. The 
presence of the NDA and NDB RBS in pAN426 and pAN427 was confirmed by DNA 
sequence analysis. Next, the panD genes from pAN426 and pAN427 were transferred 
to B. subtilis expression vector pOTP61 as shown in Figure 7, creating pAN428 and 
30 pAN429. The nucleotide sequence of pAN429 is set forth as SEQ ID NO:79. 
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Notl restriction fragments lacking the E. coli vector sequences were isolated 
from pAN428 and pAN429, self-ligated, and used to transform strain PA221 to 
resistance to Tet' s . Four isolates resistant to Tet 60 were picked from each transformation 
and assayed for pantothenate and p-alanine production along with PA221 transformed 
with the empty vector (pOTP61) and PA221 transformed with pAN423 (strain PA401) 
(see Table 8). 



10 



Table 8. 



Plasmid 



Panthothenate production by test tube cultures of PA221 transformed 
with pAN428 and pAN429 



Medium 
Supplements 



ODsso 



Pan 

g/1 



P-Ala 



pOTP61 
pAN423 

P AN428-1 * 
pAN428-2 
P AN428-3 
pAN428-4 

pAN429-l 
pAN429-2 
pAN429-3 
pAN429-4 # 



a-KIV J 
a-KIV 5 



a-KIV J 
a-KIV 5 
a-KIV 5 
a-KIV 5 



a-KIV J 
a-KIV 5 
a-KIV 5 
a-KIV 5 



10 
10 

12 
11 
11 
10 

12 
11 
11 

12 



UND 

0.4 

0.6 
0.5 
0.3 
0.1 

0.6 
0.5 
0.6 
0.8 



0.04 
0.04 

0.04 
0.03 
0.03 
UND 

0.04 
0.04 
0.05 
0.10 



pOTP61 
pAN423 

pAN428-l 
pAN428-2 
pAN428-3 
pAN428-4 

P AN429-1 
pAN429-2 
pAN429-3 
P AN429-4 



a-KIV 5 + Asp 10 



a-KIV 5 



+ Asp 



10 



a-KIV 5 + Asp 10 
5 10 

a-KIV D + Asp 
5 10 

a-KHT+Asp 
5 10 

a-KIV + Asp 

a-KIV 5 + Asp 10 
5 10 

a-KIV^ + Asp " 
5 10 

a-KIV + Asp 
5 10 

a-KlV 3 + Asp u 



11 

12 

12 
12 
12 
11 

12 
12 
12 
13 



0.5 
0.9 

0.8 
0.8 
0.9 
0.8 

0.8 
0.8 
0.8 
0.8 



0.08 
1.32 

1.97 
1.51 
1.02 
0.30 

1.78 
1.66 
1.78 
2.28 



UND: Below the 



imits of detection. * Renamed PA402 # Renamed PA403 



When grown in medium supplemented with a-KIV at 5 g/l (a-KIV 5 ), the 
pAN428-l transformant and all four of the pAN429 transformants produced more 
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10 



pantothenate than did PA401, suggesting that these transformants contain higher levels 
of aspartate decarboxylase activity. When grown in medium supplemented with a-KIV 5 
and Asp 10 none of the pAN428 or pAN429 transformants produced more pantothenate 
than PA401 . However, the pAN428-l transformant and all four of the pAN429 
transformants produced significantly more P-alanine than did PA401 . It is possible that 
the excess p-alanine produced from added aspartate causes inhibition of pantothenate 
production. Alternatively, p-alanine may accumulate because pantoate is limiting in 
these strains. 

The strains that produced the highest level of p-alanine, the pAN428-l and 
pAN429-4 transformants, were renamed PA402 and PA403, respectively. These two 
strains were grown in SVY medium supplemented with various intermediates and 
reassayed for pantothenate and P-alanine production. PA221 and PA401 were included 
as controls. The results of the assays are presented in Table 9. 



15 Table 9. 



Pantothenate production of PA402 and PA403 in test tube cultures. 



Strain 



Medium 
Supplements 



OD 550 



Pan 

g/l 



p-Ala 

g/l 



Val 

g/l 



PA221 
PA401 
PA402 
PA403 



a-KIV 
a-KIV 5 
a-KIV 5 
a-KIV 5 



7.9 
8.7 
8.5 
9.4 



UND 
0.3 
0.5 
0.7 



UND 
0.04 
0.04 
0.07 



0.9 
0.9 
0.9 
0.9 



PA221 
PA401 
PA402 
PA403 



a-KIV 5 + Asp 10 
5 10 

a-KIV + Asp 

5 10 

a-KlV 3 + Asp " 
5 10 

a-KIV + Asp 1 " 



9.8 
9.1 
9.4 
9.7 



0.4 
0.8 
0.8 
0.7 



0.11 
1.15 
2.02 
2.40 



0.8 
0.8 
0.8 
0.8 



PA221 
PA401 
PA402 
PA403 



Pantoate^ 
Pantoate^ 
Pantoate^ 
Pantoate 



8.9 
8.7 
10.6 
10.5 



UND 
0.3 
0.5 
0.7 



UND 
0.02 
0.02 
0.02 



0.2 
0.2 
0.2 
0.2 



PA221 
PA401 
PA402 
PA403 



Pantoate + Asp ft 
Pamoate^ + Asp " 
Pantoate 5 + Asp " 
Pantoate + Asp 



9.5 
9.2 
9.1 
10.2 



0.4 

2.2 
2.8 
2.9 



0.06 
0.62 
1.17 
1.58 



0.2 
0.2 
0.2 
0.2 
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UND: Below the limits of detection. 

When grown in medium supplemented with either a-KIV 5 or Pantoate 5 , PA402 
and PA403 produced significantly more pantothenate than did PA401. As before, even 
though PA402 and PA403 produced significantly more P-alanine than PA401 when 
5 grown in medium supplemented with a-KIV 5 and Asp 10 , they did not produce a 

proportional increase in pantothenate. However, when grown in medium supplemented 
with Pantoate 5 plus Asp 10 , both PA402 and PA403 produced significantly more 
pantothenate than PA401, about a 30% increase. 

It can be concluded from these experiments that the improved NDA and NDB 
10 panD ribosome binding sites, engineered into pAN428 and pAN429, respectively, lead 
to increased levels of aspartate decarboxylase activity. 

Increasing the translation of the panD gene mRNA by generation of synthetic panD 
RBSs within the panBCD operon 

15 

The native B. subtilis panD gene ribosome binding site (RBS) (SEQ ID NO:43), 
which is found in the P 2 spanBCD operon cassette present in PA221 (and in other 
engineered pantothenate production strains described herein), is shown in Table IC 
aligned with the ideal ribosome binding site (SEQ ID NO:47). The alignment shows 

20 mismatches between the native B. subtilis panD gene RBS, which is located within the 
coding sequence for PanC, and the the ideal RBS. Three new RBSs (within the P26 
panBCD operon cassette) were generated to increase translation of the panD gene 
mRNA and to yield increased synthesis of aspartate decarboxylase. These synthetic 
RBSs (termed NDI, NDII, and NDIII, also referred to herein as RBSS, RBS6 and RBS7, 

25 respectively) are set forth as SEQ ID NO:55, SEQ ID NO:56 and SEQ ID NO:57, 

respectively) and are included in Table 1C. It should be noted that although changes in 
the panD RBS within the panBCD operon also changes the C-terminal amino acid 
sequence of the PanC protein encoded by that operon, an alignment of known and 
suspected PanC protein amino acid sequences showed that the sequence of the last nine 

30 amino acids of the B. subtilis PanC protein could be altered without affecting any 
conserved amino acid residues indicating that such changes should not reduce 
pantothenate synthetase activity or expression. The new RBSs were synthesized and 
incorporated into the P 2 6 panBCD operon expression cassette as follows. 

First, PCR primers were designed to contain the following elements: (1) a 
35 nucleic acid sequence encoding the first five amino acids of PanD up to and including a 
unique BsiWl restriction site that had been previously introduced into panD by PCR; (2) 
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a stop codon for panC, (3) at least one synthetic RBS; and (4) 30-39 bp of nucleic acid 
sequence having 100% identity with panC upstream of the panD RBS. The primers 
were named TP 102, TP 103, and TP 104 and contain the NDI, NDII, and NDIII ribosome 
binding sites, respectively. These three primers were used in conjunction with the 5' 
5 primer TP 101, which hybridizes near the start codon of panC 9 in three independent PCR 
reactions to generate the NDI, NDII, and NDIII PCR products. The PCR products were 
purified, digested with Xbal, then cloned into plasmid vector pASK-1 BA3 which had 
been digested with Xbal and Smal. The resulting plasmids were named p AN43 1 , 
pAN432, and pAN433. The construction of pAN431 is illustrated in Figure 8 and is 
10 representative of all three plasmid constructions. The presence of the desired synthetic 
panD gene RBS in each new plasmid was confirmed by DNA sequencing. 

Next, the modified panC genes containing the new panD RBSs were joined with 
the panD gene utilizing the unique BsiWl restriction site. This was accomplished by 
isolating the appropriate Nsil-BsiWl restriction fragments from pAN43 1 , pAN432, and 
15 pAN433 and ligating them with a 2395 bp Nsil-BsiV/l restriction fragment from 

pAN420, which supplied the SsiWI-modified panD gene. These constructions resulted 
in plasmids pAN441 , pAN442, and pAN443, respectively. A representative 
construction (pAN441) is illustrated in Figure 9. The nucleotide sequence of pAN443 is 
set forth as SEQ ID NO:80. 

20 The new panD gene RBSs were then substituted into the P26ponBCD operon 

expression cassette as follows. First, a deletion-insertion mutation which removes the 
region of panC containing the panD RBS was created. This was constructed by 
digesting pAN430 with a mixture of BspEl and Bglll and recovering the 4235 bp 
fragment which is now missing the 3' end of panC and the 5' end of panD. This 

25 fragment was ligated with an A val-Bamlll restriction fragment from plasmid pECC4, 
which contains the chloramphenicol acetyl transferase {cat) gene. The 5' extension 
produced by Aval digestion is compatible with that produced by BspEl while the Bglll 
and BamHl extensions are also compatible. The resulting plasmid was named pAN440, 
and its construction is illustrated in Figure 10. 

30 The resulting deletion-insertion mutation was crossed into the P 2 $ panBCD 

operon via homologous recombination by transforming PA221 with linearized pAN440 
and selecting for resistance to chloramphenicol on Cam 5 plates containing 1 mM 
pantothenate. Several transformants were tested, and were all found to require 1 mM 
pantothenate for growth, as expected. Two of these transformants were remaned 

35 PA408A and PA408B and were assayed for pantothenate production. Neither strain 
synthesized measurable quantities of pantothenate, even when grown in medium 
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containing pantoate and P-alanine at 5 g/1, indicating that the strains are deficient in 
pantothenate synthetase activity. Next, the new panD RJBSs were crossed into the P26 
panBCD operon by transforming PA408 with linearized pAN441 , pAN442, and 
pAN443 plasmid DNA and selecting for growth on TB AB plates without pantothenate 
5 supplementation. A transformation with linearized pAN430 (including the native panD 
RBS) was included as a control and was expected to give rise to transformants identical 
to PA22 1 described herein. Four isolates from each transformation were assayed for 
pantothenate and P-alanine production in SVY medium supplemented with various 
intermediates (Tables 10 and 11). 

10 

Table 10. Pantothenate production of PA410 - PA413 in test tube cultures. 



Strain 


RBS 


Medium 


OD55O 


Pan 


i P-AJa 






Supplements 




g/l 


; 8/1 


PA221 


native 


Pantoate^ 


11 


UND 


UND 


PA410-I 


native 


Pantoate^ 


12 


UND 


UND 


PA410-2 




Pamoate^ 


12 


UND 


UND 


PA410-3 




Pamoate, 


12 


UND 


UND 


PA410-4 




Pantoate 


12 


UND 


UND 


PA411-1 


NDI 


Pantoate^ 


12 


0.23 


UND 


PA411-2 




Pantoate^ 


12 


0.20 


UND 


PA4I1-3 




Pantoate, 


12 


0.19 


UND 


PA411-4 




Pantoate 


12 


UND 


UND 


PA4I2-1 


NDII 


Pantoate^ 


12 


UND 


UND 


PA412-2 




Pantoate^ 


11 


UND 


UND 


PA4I2-3 




Pantoate^ 


13 


0.18 


UND 


PA412-4 




Pantoate 


12 


0.18 


UND 


PA413-1 


NDIII 


Pantoate^ 


12 


0.18 


UND 


PA413-2 




Pantoate 5 


12 


0.17 


UND 


PA413-3 




Pantoate^ 


12 


0.16 


UND 


PA413-4 




Pantoate 


12 


0.17 


UND 



UND: Below the limits of detection. 



15 



WO 01/21772 PCTAJS00/25993 

-77- 

Table 11. Pantothenate production ofPA41Q-PA413 in test tube cultures. 



Strain 


RBS 


Medium 
Supplements 


OD550 


Pan 

nil 


P-Ala 


PA22I 


native 


Pantoate 5 + Asp 10 


11 


0.3 


UND 


PA410-1 
PA4I0-2 
PA410-3 
PA410-4 


native 


r» « j 5 , a 1 0 

Pantoate^ + Asp^ 
Pantoate^ + Asp^ 
Pantoate + ^sp 
Pantoate +Asp 


12 
12 
12 
12 


0.4 
0.4 
0.4 
0.4 


UND 
UND 
UND 
UND 


PA411-1 
PA41I-2 
PA411-3 
PA41I-4 


NDI 


n . 5 , * 10 
Pantoate^ + Asp^ 

Pantoate^ + ^ s P|q 
Pantoate^ + ^ s Pjo 
Pantoate + Asp 


13 
13 
13 
13 


1.7 
1.7 
1.8 
0.4 


0.4 
0.4 
0.3 
UND 


T» A A 1 "> 1 

PA4I2-1 
PA412-2 
PA4I2-3 
PA412-4 


X TT~\T T 

NDII 


Do . f 5 , A 10 
Pantoate^ + Asp^ 

Pantoate ^ + Asp^ 

Pantoate^ + Asp^ 

Pantoate + Asp 


13 
12 
12 
12 


A A 

0.4 
0.4 
1.6 
1.5 


UND 
UND 
0.3 
0.2 


PA413-1 
PA413-2 
PA413-3 
PA413-4 


NDI II 


Pantoate^ + Aspjjj 
Pantoate 5 + Asp^ 
Pantoate^ + Asp^ 
Pantoate + Asp 


13 
13 
13 
13 


1.6 
1.6 
1.7 
1.7 


0.3 
0.4 
0.4 
0.4 



UND: Below the limits of detection. 



5 As expected from previous experiments using PA22 1 , none of the transformants 

that contained the native panD RBS produced measurable, quantities of pantothenate 
when grown in medium supplemented with pantoate. However, nine of the twelve 
transformants expected to contain modified panD RBSs produced significant quantities 
of pantothenate (160-230 mg/1) under these conditions, indicating that they possess 

10 elevated levels of aspartate decarboxylase activity. When grown in medium 

supplemented with both pantoate and aspartate, these same nine transformants produced 
approximately four times more pantothenate than those with the native panD RBS. In 
addition, these nine transformants accumulated measurable quantities of p-alanine (230- 
410 mg/1). All transformants produced roughly equivalent quantities of pantothenate 

15 when grown in medium containing pantoate and p-alanine, demonstrating that each 
contains a functional pantothenate synthetase. 
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These data demonstrate that the synthetic panD RBSs are about four times more 
effective than the native panD RBS in directing translation of the panD gene mRNA and 
evidence the utility of such synthetic RBSs in enhancing pantothenate production. 
Additional approaches to increasing pantothenate production can include, for example, 
5 increasing the half-life of the panD gene mRNA, increasing the strength of the promoter 
for panD transcription and/or increasing the stability of the PanD protein. 

a 

EXAMPLE VI: Construction of Strains Containing an Integrated P26 panEl 
Cassette without an Antibiotic Resistance Gene* 

10 

Example II describes the identification of the B. subtilis panEl gene that encodes 
the enzyme responsible for the majority of the ketopantoate reductase activity in B. 
subtilis, PA236 (containing the pAN236 plasmid) produced about twice as much 
pantothenate (2 g/1) as its parent strain, PA221 (1 g/1) in 24 hour SVY test tube cultures. 

1 5 PA236 was presumed to contain an amplified (-3 copies) integrated pAN236 plasmid 
based on selection for tetracycline resistance (the tetR gene product being encoded on 
the pAN236 plasmid in addition to the P 26 panEl cassette). Also useful in the 
methodologies of the present invention are strains that contain a single integrated 
unamplifiable copy of P26 panEl at the panEl locus, for example, without an antibiotic 

20 resistance gene in the strain. Such a strain was generated as follows. 

A plasmid named pAN251 was derived from pAN236 by inserting additional 
chromosomal sequences just upstream and just downstream from the P26 panEl cassette. 
These additional sequences, which provide homology to allow integration of the P26 
panEl cassette at panEl by double crossover, were obtained by PCR from chromosomal 
25 DNA as a template. pAN25 1 is shown in Figure 1 1 . The nucleotide sequence of 
pAN251 is set forth as SEQ ID NO:81. 

Next, a strain was constructed which allowed selection for the incoming P26 
panEl cassette. The strain included the following three components: (1) P 2 6 panBCD; 
(2) ApanEl\ and (3) //vC, since both panEl and ilvC must be mutated to have a Pan' 

30 phenotype. The starting strain was CU550 (trpC2, ilvC4, leuC124\ The P 26 panBCD 
cassette from PA221 chromosomal DNA was introduced in two steps to create strain 
PA290. Next, ApanElr.spec was transformed into PA290, using chromosomal DNA 
from strain PA240, to give strain PA294 (trpC2, ilvC4, leuC124, P 26 panBCD, 
ApanEl::spec), which is a strict pantothenate auxotroph. Finally, PA294 was 

35 transformed with plasmid pAN251, selecting for pantothenate prototrophy, to give strain 
PA303. This strain was expected to have the genotype trpC2, ilvC4, leuC124, P26 
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panBCD, P26 panEL PA303 was checked for the correct chromosomal structure at the 
panEX locus by PCR using primers that flank the P26 insertion just upstream of panEl. 
The PCR product from PA303 was of the expected size, with a concomitant loss of the 
PCR product from the wild type panEl gene, consistent with having obtained the 
5 desired double crossover event. Furthermore, PA303 was tetracycline sensitive, which 
is also consistent with the desired double crossover event, as opposed to a Campbell- 
type single crossover of the plasmids into the chromosome. The trp, ilv, and leu 
auxotrophies from the parent strain were all maintained in PA303. 

In 24 hour liquid SVY test tube cultures, PA303 produced almost the same level 
10 of pantothenate as positive control PA236, and about twice as much as PA221, which 
does not contain engineered panEl as indicated in Table 12. 

Table 12. Pantothenate production by 24 hr. test tube cultures of PA303 and 
controls grown in SVY plus 5 g/l a-KlV and 5 g/l ^-alanine. 

15 



Strain 


OD600 


[pan] g/I 


PA221-1 


10.9 


0.85 


PA221-2 


10.5 


0.85 


PA236-1 


9.5 


1.74 


PA236-2 


9.3 


1.70 


PA303-1 


10.8 


1.66 


PA303-2 


10.7 


1.61 



EXAMPLE VII: Generation of Microorganisms Capable of Producing 
Pantothenate in an a-KIV (or Valine) Independent Manner 

20 a-ketoisovalerate (a-KIV) is a rate limiting intermediate for pantothenate 

production in certain strains deregulated for pantothenate synthesis. Addition of either 
a-KIV or valine at 5 g/l increases pantothenate production about 5-fold in test tube 
cultures with strains such as PA22 1 . In order to alleviate the need to feed either a-KIV 
or valine, strains were engineered that have an increased capacity to synthesize a-KIV. 

25 a-KIV is produced in B. subtilis from pyruvate by the sequential action of three 

enzymes encoded by four genes, ilvB and UvN, ilvC, and ilvD. In a wild type B. subtilis, 
three of the genes (ilvB, ilvN, and ilvC) are the first three genes of the large ilv-leu 
operon. The fourth gene necessary for a-KIV synthesis, ilvD, is located by itself 
elsewhere on the chromosome. The B. subtilis ilv-leu operon is thought to be regulated 
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only by leucine levels. Feeding of exogenous leucine reduces transcription of the ilv-leu 
operon by about 13-fold, probably by an attenuation mechanism (Grandoni et al (1992) 
1 Bacteriol 174: 3212-3219). The only known feedback regulation in the ilv-leu 
pathway is the inhibition of the lenA gene product by leucine. 

5 As a first step to deregulate the synthesis of a-KIV, a copy of the ilvBNC region 

from the wild type B. subtilis ilv-leu operon was isolated by PCR, and installed adjacent 
to the P 26 promoter and RBS2 on a vector, pOLL8, that was designed to integrate a 
single P26 expression cassette by double recombination at the amyE locus. The amyE 
gene encodes a nonessential ot-amylase, and is a useful locus for installing expression 
10 cassettes. The resulting plasmid, pAN267, is illustrated in Figure 12. The nucleotide 
sequence of pAN267 is set forth as SEQ ID NO:82. pAN267 readily gave stable 
transformants by double crossover at the amyE locus of B. subtilis strains, as described 
in detail below. 

Construction of pantothenate overproducing strains that are leucine prototrophs 

15 Initially, a B. subtilis strain containing ilvC4 and ApanEl was used to introduce a 

single copy of P 26 panEl into the chromosome without using an antibiotic resistance 
gene. The double mutant was required to select for the incoming P26 panEl cassette 
because a ApanEl mutation alone does not result in pantothenate auxotrophy. A strain 
named CU550 was obtained containing ilvC4 to be used as a basis for this type of strain 

20 construction. However, CU550 also contains a closely linked leuC124 mutation, so all 
strains derived from CU550 required leucine. Having shown that the combination of P 2 6 
panBCD and P 2 & panEl was favorable for pantothenate production, the next step was to 
reassemble this combination of two cassettes in a leucine prototroph. 

Accordingly, the two cassettes were combined in two different strain 
25 backgrounds, RL-1 and PY79. To introduce chromosomal P 26 panEl into the PY79 and 
RL-1 strain backgrounds without using an antibiotic resistance gene, a strategy was used 
that did not rely on ilvC4, (The strategy took advantage of the observation that the 
ApanEl mutation causes a pantothenate bradytrophy, manifested by relatively small 
colonies on TBAB (rich) plates). First, ApanB::cat and ApanE::spec were introduced 
30 into both strain backgrounds. Next, the resulting strains were transformed 

simultaneously with DNA from two strains, PA221 (P 26 panBCD) and PA303 (P 26 
panEl\ selecting for Pan + on TBAB plates. Colonies of two distinct sizes grew on the 
selective plates, with the larger size comprising about 2% of the colonies. The larger 
colonies were presumed to represented co-transformants that received both Pu panBCD 
35 and P 2 e panEl, and that the smaller colonies had received only P 26 panBCD. Consistent 
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with this prediction, the larger colonies had lost both Cam r and Spec', while the smaller 
colonies had lost only the cat gene, and retained the spec gene. Furthermore, a 
representative derivative of PY79 named PA327, and a representative derivative of RL- 
1, named PA328, both produced the elevated levels of pantothenate in test tube cultures 
5 which was about 1 .6 to 1.7 g/1 (Table 13). 

Table 13. Pantothenate production of PA327, PA328, and controls from 24 hr 

test tube cultures grown in SVYplus 5 g/l a-KW and fl-alanine. 



P26 panEl copy 



Strain 


Background 


number 


[pan] g/l 


PA22I-1 


RL-1 


0 


0.92 


PA221-2 


RL-1 


0 


0.95 


PA236-1 


RL-1 


amplified (~3) 


1.60 


PA236-2 


RL-1 


amplified (~3) 


1.73 


PA327-1 


PY79 


1 


1.66 


PA327-2 


PY79 


1 


1.65 


PA328-1 


RL-1 


1 


1.61 


PA328-2 


RL-1 


1 


1.91 



10 

Thus, PA327 and PA328 were concluded to contain both P 2 6 panBCD and P 2 6 
panEl, and were used for further constructions as described below. PCR analysis 
confirmed the presence of the two cassettes. 

15 Installation of a stable P26 ilvBNC cassette into two lineages of pantothenate 
overproducing strains 

Having constructed PA327 and PA328, derivatives of PY79 and RL-1 that 
contain P26 panBCD and P26 panEl, and that are Leu + , the next step was to introduce 
stable copies of P26 ilvBNC. This was accomplished by transforming PA327 and 
20 PA328 with plasmid pAN267, selecting for Spec r . Screening by PCR showed that about 
85% of the obtained transformants contain P26 ilvBNC integrated at amyE by double 
crossover. One transformant of PA327, named PA340, and one transformant of PA328, 
named PA342, were chosen for further study. 

In test tube cultures grown in SVY medium plus 5 g/l p-alanine but without 
25 added a-KIV, both PA340 and PA342 gave the expected increase in pantothenate 
production over that of PA327 and PA328, to about 1 .3 to 2 g/l (Table 14). 
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Table 14. Pantothenate and valine production by PA340 and PA342, both 

containing P 26 ilvBNC in 24 hr test tube cultures grown in SVY with 5 g/l 
p-alanine and with or without 5 g/l a-KIV 



Strain 


Back- 
ground 


OD600 
-a-KIV 


+ a-KIV 


[pan] g/l 
-a-KIV 


+ a-KIV 


[val] g/l 
-a-KIV 


+ a-KIV 


PA340-1 


PY79 


118 


7.1 


2.02 


2.10 


0.38 


0.90 


PA340-2 


PY79 


10.3 


7.5 


1.97 


2.03 


0.40 


0.91 


PA342-1 


RL-I 


10.2 


8.0 


1.29 


1.89 


0.27 


0.78 


PA342-2 


RL-1 


9.6 


9.2 


1.34 


2.04 


0.21 


0.79 



The two new strains also gave a slight increase in valine secretion, indicating 
that the ilvBNC genes had been deregulated. However, when the same strains were 
grown with 5 g/l a-KIV added, a further increase in pantothenate production occurred 
10 from PA342, suggesting that a-KIV was still rate limiting in this strain background. 
Similar results, only with more growth and hence higher pantothenate levels, were seen 
in shake flask cultures (Table 15). 

Table 15. Pantothenate and valine production by PA340 and PA342, both 
1 5 * containing P 2 6 ilvBNC in 24 hour shake flask cultures grown in SVY with 

5 g/l {[-alanine and with or without 5 g/l a-KIV. 



Strain 


Back- 
ground 


OD600 
-a-KIV 


+ a-KIV 


[pan] g/l 
- a-KIV 


+ a-KIV 


[val] g/l 
- a-KIV 


+ a-K!V 


PA327 


PY79 


21 


22 


0.6 


3.0 


0.5 


1.3 


PA340-1 


PY79 


20 


20 


3.5 


4.1 


1.0 


1.9 


PA340-2 


PY79 


22 


19 


3.0 


2.1 


0.8 


1.4 


PA328 


RL-1 


20 


16 


1.4 


2.7 


0.6 


1.3 


PA342-1 


RL-1 


17 


16 


3.3 


3.6 


0.9 


1.6 


PA342-2 


RL-1 


18 


18 


3.1 


4.2 


0.8 


1.4 
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EXAMPLE VIII: Increasing panD Copy Number in Strains Engineered to 
Overproduce panEl and the UvBNC Gene Products Enhances Pantothenate 
Production 

Experiments where p-alanine was fed to cultures of engineered B. subtilis strains 
5 consistently showed that P-alanine was a rate limiting intermediate in pantothenate 

synthesis. The effect of adding additional copies of panD on pantothenate production in 
PA340 and PA342 was examined. Strains PA340 and PA342 were transformed with 
chromosomal DNA isolated from PA401 with selection on plates containing 15 ng/ml of 
tetracycline (Tet 15 plates). Transformants derived from each parent were patched onto 

10 Tet 60 plates to identify those which were likely to contain multiple copies of the 
expression cassette. Twelve transformants from each transformation which grew on 
Tet 60 were streaked for single colonies on this medium and then assayed in SVY 
medium test tube cultures for pantothenate production. One transformant from each 
group was found to produce greater than 300 mg/1 pantothenate in 24 hours. These two 

15 transformants were saved and named PA404 (PA340 strain background) and PA405 

(PA342 strain background). Both strains were resistant to spectinomycin, indicating that 
the P 2 6 UvBNC expression cassette was still present at amyE. PGR analysis of 
chromosomal DNA isolated from each strain confirmed that the deregulated panEl gene 
had also been retained. 

20 Next, PA404 and PA405 were evaluated in shake flask cultures which were 

grown in SVY medium containing maltose as the carbon source and supplemented with 
various intermediates. The cultures were grown for 24 and 48 hours and then assayed 
for pantothenate, P-alanine, and valine production. The results of this experiment are 
presented in Table 16. Analogous shake flask culture data for the parent strains (PA340 

25 and PA342) are included in the tables for comparison. 

Table 16. Pantothenate production by PA404 and PA405 in shake flask cultures 
after 24 hours 



Strain 


Medium 
Supplements 


OD600 


Pan 

g/l 


P-Ala 

g/l 


Val 

g/l 


PA340 


none 


20 


0.4 


<0.1 


1.0 


PA404 


none 


22 


1.8 


<0.1 


0.7 


PA342 


none 


19 


0.3 


0.2 


0.7 


PA405 


none 


19 


1.4 


0.4 


0.5 
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PA340 
PA404 
PA342* 
PA405* 



P-alanine 
(3-alanine 
^-alanine 
P-alanine^ 



Values are the average i 



18 


3.6 


3.2 


0.6 


18 


2.8 


5.1 


0.7 


17 


3.3 


3.3 


0.5 


19 


1.3 


6.5 


0.6 



In the absence of any medium supplementation, PA404 and PA405 made four to 
five times more pantothenate in 24 hours compared to their isogenic parent strains 
(Table 16). The supply of p-alanine was clearly limiting in the parent strains PA340 and 
PA342. Addition of amplified P26 panD greatly increased the supply of p-alanine. 



EXAMPLE IX: Deregulation of the B. subtilis UvD Gene Enhances Pantothenate 
Production 

1 0 To deregulate expression of the UvD gene, standard procedures (described 

above) were used to integrate the constitutive P26 promoter and an artificial ribosome 
binding site, RBS2, just upstream of the UvD coding region. The UvD gene maps by 
itself, unlinked to the UvBNC operon. First, a 2.4 kb region of the RL-1 chromosome 
that contains the UvD coding region and 730 bp of upstream sequence was cloned by 

1 5 PCR into a low copy (about 1 5 per E. coll cell) vector called pOK 1 2, to give plasmid 
pAN257, shown in Figure 13. 

Taking advantage of a natural EcoRl site just upstream of the native UvD gene 
promoter, and a natural Ncol site at the UvD start codon, an artificial sequence 
containing P 2 6 and RBS2 was inserted into pAN257 to give pAN263 (Figure 14). The 

20 nucleotide sequence of pAN263 is set forth as SEQ ID NO:83. In parallel with this 
construction, the cat gene was also inserted into pAN257, between the same upstream 
£coRI site and a Bglll site in the middle of the UvD coding region, to give pAN261 , 
which is deleted for a large portion of the UvD gene (Figure 1 5). 
Using pAN261 and pAN263, the P26 UvD cassette could then be installed in the B. 

25 subtilis chromosome in two steps. In the first step, pAN261 is introduced by 

transformation, selecting for chloramphenicol resistance, and then confirming an Ilv" 
phenotype. In the second step, pAN263 is introduced, selecting for Ilv + , checking for 
chloramphenicol sensitivity, and confirming correct local structure by PCR. 

pAN261 was first transformed into strain RL-1 (highly competent) to give strain 
30 PA343 (AilvD::cat\ and then chromosomal DNA from PA343 was used to transform 
PA340 and PA342 to Ilv" auxotrophy, yielding strains named PA348 and PA349, 
respectively. Chromosomal DNA is inherently more efficient than monomelic plasmid 
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in transforming B. subtilis. Similarly, pAN263 DNA was transformed into PA343 
(moderately competent) to give strain PA345 (P26 //vZ)), and then PA345 chromosomal 
DNA was used to transform PA348 and PA349 to Ilv + prototrophy, yielding strains 
PA374 and PA354, respectively. 

5 As predicted, PA374 and PA354 gave further increases in pantothenate 

production, to about 2.5 to 2.9 g/l, in test tube cultures grown in SVY plus 5 g/l P- 
alanine (Table 17). 

Table J 7, Pantothenate and valine production by PA374 and PA354, containing P 2 6 
10 ilvD, and controls, in 24 hr test tube cultures grown in SVY with 5 g/l /?- 

alanine and with or without 5 g/l a-KIV. 









OD 600 


[pan] 
g/l 


[val] 


Strain 


Back- 
ground 


ilvD status 


a-KIV 


+ 


a-KIV 


+ 


a-KIV 


+ 


PA340 


PY79 


w.t. 


9.2 


9.0 


2.14 


2.23 


0.38 


0.90 


PA348 


PY79 


ilvD::cat 


11.7 


10.0 


0.19 


2.23 


0.19 


0.91 


PA374-1 
PA374-2 


PY79 
PY79 


P26 HvD 


9.1 
8.2 


7.3 
7.7 


2.93 
2.99 


2.40 
2.36 


0.58 
0.60 


0.87 
0.95 


PA342 


RL-1 


w.t. 


10.2 


8.0 


1.29 


1.89 


0.27 


0.78 


PA349 


RL-1 


ilvDr.cat 


8.1 


7.7 


0.17 


1.87 


0.22 


0.88 


P A3 54-1 
PA354-2 


RL-1 
RL-1 


P26 HvD 
P26 HvD 


9.6 
7.5 


9.6 
8.2 


2.57 
2.48 


2.03 
2.24 


0.65 
0.64 


1.23 
0.97 



In the absence of added p-alanine, strains PA374 and PA354 produced only 
1 5 about 0.2 g/l pantothenate in test tube cultures, indicating that PanD activity is 
significantly rate limiting. 

To alleviate this limitation, the amplifiable P 2 6 panD cassette from strain PA401 
was installed. PA401 chromosomal DNA was transformed into PA374 and PA354, 
selecting for Tet r at 15 mg/1, to yield strains PA377 and PA365, respectively. After 
20 transformants were obtained, the strains were streaked on plates containing 30 and 60 
mg/1 tetracycline to reamplify the copy number of the P& panD cassette integrated at the 
bpr locus. In test tube cultures grown in SVY without a-KIV or p-alanine, a substantial 
improvement in pantothenate titers over those of PA374 and PA354 was obtained 
(Tables 18 and 19). 
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Table 18. Pantothenate production by PA 365, containing amplified P 2 6 panD, and 
controls, in 24 and 36 hr test tube cultures grown in SVY-glucose without 
p-alanine or a-KIV. 







OD 600 


[pan] g/l 




Relevant genotype 










Strain 




24 hrs. 


36 hrs 


24 hrs. 


36 hrs. 


PA342-1-1 


w.t. ilvD 


11.7 


8.8 


b.d. 


0.27 


PA342-1-2 


w.t. ilvD 


12.8 


8.8 


b.d. 


0.26 


PA354-1-1 


P26 HvD 


n.d. 


11.0 


n.d. 


0.19 


PA354-1-2 


P26 ilvD 


n.d. 


8.4 


n.d. 


0.20 


PA365-1 


P26 HvD, P26panD 


9.8 


10.0 


1.01 


2.07 


PA365-2 


P26'lvD, P26panD 


9.9 


10.4 


0.96 


2.09 



5 

n.d. = not determined; b.d. = below detection 



Table 19. Pantothenate production by PA3 77, containing amplified P 2 6 panD, and 
controls, in 27 hr test tube cultures grown in SVY-glucose or SVY- 
10 maltose, without a-KIV, and with or without ^-alanine. 



Relevant genotype 

Strain 


OD 600 


- 0-ala + P-ala - 0-ala + P-ala 
Glucose Glucose Maltose Maltose 


P A3 74-1 P26HvD 
PA374-2 P26HvD 

PA3 77- 1 P26 i/vA P26 panD 
PA377-2 P26 f/vA P26 panD 

Relevant genotype 

Strain 


9.4 9.8 7.0 6.4 
9.2 9.6 6.6 6.3 

10.0 7.6 7.2 6.1 
10.5 7.8 9.4 5.4 

[pan] g/l 


-P-al a + P-ala -p-ala + P-ala 
Glucose Glucose Maltose Maltose 


PA374-1 P26*lvD 
PA374-2 P 2 6ilvD 

PA3 77- 1 P26 i/vA P26 P anD 
PA3 77-2 P26 i/vA P26 panD 


0.04 2.76 0.14 1.31 
0.10 2.65 0.15 1.33 

1.25 2.76 1.26 1.10 
1.25 2.35 1.31 1.26 



In SVY with glucose, an increase in pantothenate production can still be 
15 achieved by feeding 5 g/l p-alanine suggesting that increasing panD expression further 
might increase pantothenate production. In SVY with maltose, no further increase in 
pantothenate was obtained by feeding P-alanine suggesting that p-alanine and/or 
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aspartate synthesis is suppressed by glucose. Strains PA377 and PA365 have been 
evaluated in 10 liter fermentors, where they typically produce above 20 g/l pantothenate 
in 48 hours without supplemental (J-alanine and a-Kl V or valine, described in detail 
below. 

5 

EXAMPLE X: 10 liter Fermentations of Pantothenate-Producing Microbes 

Engineering of the ilvBNC and P26 HvD cassettes to give strains PA342 and 
PA354 allowed the production of 22 and 26 g/l of pantothenate, respectively, without 
the addition of valine or a-KIV to the fermentation medium (Table 20). At 48 hours, 
10 both strains had secreted about 0.5 g/l of valine into the medium. 



Table 20. 10-liter fermentations of five pantothenate overproducing strains. 



Strain 


Medium 


Feed 
40% Glucose 
plus 


OD 600 
48 hr 


Valine 
48 hours 
g/l 


P-ala 
48 hr 
g/l 


Pantothenate 
g/L 

36 hr 48 hr 72 hr 


PA 236 


SVYG 


50 g/l p-ala 
25 g/l a-KIV 


108 


added 


. added 


16 


19 


21 


PA 342 


SVYG 


50 g/l p-ala 


92 


0.5 


added 


17 


22 




PA 354 


SVYG 


50 g/l P-ala 


90 


0.5 


added 


19 


26 




PA 365 


SVYG 


25g/l YE 


77 


0.85 


0.4 


18 


21 


27 


PA 377 


SVYG 


25g/l YE 


85 


1.5 


0.5 


18 


22 


31 


PA 377 


PFMG 


25g/l YE 


96 


0.8 


0.4 


19 


25 


29 


PA377 


PFMG 




71 


0.7 


0.1 


16 


21 





15 Pantothenate synthesis in fermentors 

With the addition of the P 2 6 panD cassette to strains PA354 and PA374 to create 
strains PA365 and PA377, neither p-alanine nor a-KIV needed to be added to the 
fermentors. Strain PA365 produced 21 g/L pantothenate in 48 hours and 27 g/l in 72 
hours with no precursors added to the medium (Table 20). PA377 was somewhat better, 
20 producing 1 8 g/l of pantothenate in 36 hours, 22 g/l in 48 hours, and 3 1 g/l in 72 hours). 
Valine was measured at 0.85 and 1 .5 g/l for strains PA365 and PA377, respectively, at 
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48 hours in SVYG medium. Strain PA377 maintained valine between 1-1 .5 g/1 
throughout most of the fermentation and p-alanine between 0.2 and 0.5 g/1. 

Strain PA377 was further evaluated in 10-liter fermentors in yeast extract based 
PFMG medium. Pantothenate yields in PFMG and SVYG medium were similar. In 
5 PFMG, PA377 produced 19 g/1 of pantothenate in 36 hours, 25 g/1 in 48 hours, and 29 
g/1 in 72 hours. In SVYG, PA377 produced 18 g/L pantothenate in 36 hours, 22 g/L in 
48 hours and 3 1 g/L in 72 hours (Table 20). 

EXAMPLE XI: Converting Strain PA377 to a Tryptophan Prototroph 

0 

PA377 (Trp') was transformed to Trp + using chromosomal DNA from PY79 to 
give strain PA824. After re-amplification of the P26panD casette, PA824 was compared 
to PA377 for pantothenate production in test tube cultures grown in SVY glucose with 
or without 5 g/L P-alanine (Table 21). 

5 

Table 21 : Trp + derivatives of PA377: Pantothenate production in 48 hour test tube 
cultures grown in SVY glucose, ±fi-alanine 



Strain 


trpC donor 


OD 6 oo 


[pan] g/L 


- p-alanine 


+ P-alanine 


- P-alanine 


+ P-alanine 


PA377-1 


RL-1 


8 


8 


1.5 


3.4 


PA377-2 


RL-1 


8 


9 


1.6 


3.6 


PA824-1 


PY79 


12 


10 


0.7 


3.7 


PA824-2 


PY79 


11 


11 


1.9 


4.9 



20 The Trp+ strains grew to slightly higher densities than PA377. In the absence of 

exogenous p-alanine, all of the strains produced similar levels of pantothenate, while 
with the addition of P-alanine, the Trp+ derivatives produced somewhat more 
pantothenate. 

25 Fermentor studies with PA824 

PA824 was evaluated in CF3000 Chemap 14 liter vessels with 10 liter working 
volumes. Formulations for two of the media used in the fermentors are given in Tables 
22 and 23. 
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Table 22 : Formulation for PFMG-5 medium 



BATCH 





MATERIAL 


g/L (final []) 


1 


Amberex 1003 


10 


2 


Na Glutamate 


5 


3 


(NH 4 ) 2 S0 4 


8 








4 


MAZU DF 37C 


2.5 








Added After Sterilization and Coo 


Down 








1 


KH 2 P0 4 


10 


2 


K 2 HP0 4 -3H 2 0 


20 








1 


Glucose 


20 


2 


MgCl 2 -6H 2 0 


1 


3 


CaCl 2 -2H 2 0 


0.1 








1 


Sodium Citrate 


1 


2 


FeS0 4 -7H 2 0 


0.01 


3 


SM-1000X 


1.0 ml 










H 2 0 


qs to 6000 ml 


FEED 




MATERIAL 


g/L 


1 


Glucose 


600 


2 


CaCl 2 -2H 2 0 


0.6 










H 2 0 


qs to 3000 ml 
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Table 23 : Formulation for SVY-4 medium 



BATCH 





MATERIAL 


g/L (final []) 


1 


Veal Infusion 


25 


2 


Yeast Extract 


5 


3 


Na Glutamate 


5 


4 


(NH 4 ) 2 S0 4 


4 








5 


MAZU DF 37C 


2.5 | 








Added After Sterilization and Coo 


IDown 








1 


KH2PO4 


10 


2 


K 2 HP0 4 -3H 2 0 


20 








1 


Glucose 


20 


2 


MgCl 2 -6H 2 0 


1 


3 


CaCl 2 -2H 2 0 


0.1 








1 


Sodium Citrate 


1 


2 


FeS0 4 -7H 2 0 


0.01 


3 


SM-1000X 


1.0 ml 










H 2 Q 


qs to 6000 ml 


FEED 




MATERIAL 


g/L 


1 


Glucose 


600 


2 


CaCl 2 2H 2 0 


0.6 










H 2 0 


qs to 3000 ml 



All fermentations were glucose limited fed batch processes. Immediately after 
inoculation, agitation was set at 200 rpm. The initial batched 2% glucose was consumed 
10 during exponential growth. Afterwards, glucose concentrations were maintained 
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between 0.2 and 1.0 g/L by continuous feeding of a 60% glucose solution. The variable 
rate feed pump was computer controlled and linked to the dissolved oxygen 
concentration [pCh] in the tank by an algorithm. When the [pC>2]fell to 30%, computer 
control began to automatically adjust the agitation rate to maintain a dissolved oxygen 
5 concentration between 25 and 30% [pCy. Computer control and data recording were by 
Braun MFCS software. 

In one study, PA284 was grown in fermentors at two temperatures (40°C and 
43°C) in the medium described in Table 22. Results of two experiments demonstrated 
that the highest pantothenate titers at early time points were produced at 43 °C. The cell 

1 0 mass approached 1 50 optical density units at OD600 and 56 hours at 43°C, and the 

pantothenate titers were 21 g/L, 28 g/L and 36 g/L at 36, 48 and 72 hours respectively. 
In the parallel fermentation at 40°C, the cell mass approached 120 optical density units 
at OD600 and 56 hours, and the pantothenate titers were 18 g/L, 26 g/L and 37 g/L at 36, 
48 and 72 hours, respectively. 

1 5 In another study, PA824 was grown in a fermentor at 43°C in the medium 

described in Table 23. The cell mass exceeded 160 optical density units at OD600 and 36 
hours, and the pantothenate titers were 23 g/L, 34 g/L, 37 g/L and 40 g/L at 24, 36, 48 
and 60 hours, respectively. In other fermentations, increasing the amount of trace 
elements in the glucose feed (e.g., increasing the concentration of SM from IX to 2X) 

20 resulted in even higher titers of pantothenate. 

EXAMPLE XII: Identification and characterization of the B. subtilis coaA gene 
product 

The annotated version of the B. subtilis genome sequence available on the 
25 "Subtilist" web site contains no gene labeled as coaA. However a homology search 
using the protein sequence of E. coli pantothenate kinase as a query sequence gave a 
good match with B. subtilis geneyqjS, which is annotated as "unknown; similar to 
pantothenate kinase." This gene appears to be the penultimate gene in an operon 
containing five open reading frames (Figure 18). Two of the open reading frames 
30 encode proteins which are similar to D-serine dehydratase and to "ketoacyl reductase"; 
the other two have no known homologies. For the open reading frame corresponding to 
coaA y there are three possible start codons; each having a possible ribosome-binding site 
(RBS) associated with it. The three potential coaA ORFs were named coaA 1 , coaA2, 
and coaA3, from longest to shortest. 
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All three potential coaA open reading frames were cloned along with their 
respective RBSs by PCR followed by ligation into expression plasmid pAN229. 
pAN229 is a low copy vector in E. coli that provides expression from the SP01 phage 
Pjs promoter and can integrate by single crossover at bpr with tetracycline selection. A 
5 representative resulting plasmid, pAN281, is shown in Figure 19. 

To determine if the cloned putative coaA ORFs actually encode a pantothenate 
kinase activity, several isolates of all three plasmids were transformed into the E. coli 
strain YH1, that contains the coaA15(Ts) allele. Transformants were streaked to plates 
incubated at 30° and 43°C to test for complementation of the temperature sensitive 
10 allele. All isolates of all three coaA variants, except for one isolate of pAN282, 

complemented well at 43 °C, indicating that all three plasmid constructs encode an active 
pantothenate kinase. Accordingly, it can be concluded that the B. subtilisyqjS open 
reading frame codes for an active pantothenate kinase. 

15 EXAMPLE XIII: Deletion of the coaA gene from the B. subtilis genome 

The coaA gene of B. subtilis iyqjS) was deleted from the chromosome of a B. 
subtilis strain by conventional means. The majority of the coaA coding sequence was 
deleted from a plasmid clone and replaced by a chloramphenicol resistance gene (cat), 
while leaving approximately 1 kb of upstream and downstream sequence to allow 

20 homologous recombination within the chromosome, to give plasmid pAN296 (see 

Figure 17). pAN296 was then used to transform a B. subtilis strain (PY79), selecting for 
chloramphenicol resistance. The majority of transformants result from a double 
crossover event that effectively substitutes the cat gene for the coaA gene. The 
transformed strain containing the coaA deletion - cat insertion grew normally due the 

25 presence of a second B. subtilis pantothenate kinase encoding gene described herein. 

EXAMPLE XIV: Identification and characterization of a second B. subtilis gene 
encoding pantothenate kinase activity 

As described in detail in the instant specification, in order to maximize 
30 pantothenate production, it is necessary to restrict the flow of pantothenate toward 
Coenzyme A (CoA), for example, by reducing the activity of pantothenate kinase, the 
first enzyme in the pathway from pantothenate to CoA. After finding that deletion of 
the coaA gene from the chromosome of B. subtilis is not a lethal event (see Example 
XIII), it was concluded that B. subtilis must contain a second gene that encodes an active 
35 pantothenate kinase, since pantothenate kinase is an essential enzyme activity. 
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A second pantothenate kinase-encoding gene was identified by complementing 
the E. coli strain YH1 (coaA15(Ts)) with a B. subtilis gene bank and selecting for 
transformants that were able to grow at 43°C. Found among the transformants were two 
families of plasmids that had overlapping restriction maps within each family, but not 
5 between the families. As expected, the restriction map of one family was identical to 
that predicted from the B. subtilis genome sequence for the homologue of the E. coli 
coaA gene (which we named coaA also, see above) and surrounding sequences. The 
other family had a restriction map that was completely non-overlapping with the first. 

DNA sequencing of the ends of the cloned inserts from the second family 
10 showed that the clones came from a region of the B. subtilis chromosome that includes 
the 3' end of the ftsli gene, the 5' end of the sul gene, and all of the yacB, yacC, yacD, 
cysK, pabB, pabA and pabC genes. None of the open reading frames of these cloned 
inserts showed homology to any known pantothenate kinase sequences, either 
prokaryotic or eukaryotic. 

1 5 Several deletions were created through the B. subtilis genomic sequences in the 

cloned inserts. Each deletion was tested for complementation of the E. coli temperature 
sensitive pantothenate kinase. In particular, a deletion that removed all DNA between a 
Stu I site in the cloning vector and a Swa I site in the yacC gene, leaves yacB as the only 
intact open reading frame in the cloned insert (see Figure 21). This deleted plasmid still 

20 complemented the E. coli pantothenate kinase mutant. However, another deletion that 
removed DNA from the Swa I site in yacC through a Bst\ 1071 site in the (already 
truncated) ftsH gene, could not complement the E. coli pantothenate kinase mutant. 
From these results, it was concluded that the yacB open reading frame was responsible 
for the complementation activity. To confirm that yacB is a pantothenate kinase gene, 

25 the yacB ORF plus 1 12 base pairs of downstream flanking sequence was amplified by 
PCR in two independent reactions and cloned downstream of a constitutive promote to 
give plasmids pAN341 and pAN342 (Figure 22). Both pAN341 and pAN342 
complemented the defect in YH1 at 44°C, while a control plasmid, which has the same 
backbone, but expresses panBCD instead of yacB did not. This confirmed that the yacB 

30 open reading frame was responsible for the complementation of YH1 . 

As such, a novel gene that encodes pantothenate kinase activity in 5. subtilis has 
been discovered that is not related by homology to any previously known pantothenate 
kinase gene. This gene has been renamed coaX, as a second, alternative gene that 
encodes an enzyme that catalyzes the first step in the pathway from pantothenate to 
35 CoaA. Deletion of coaX by methods described above for deleting coaA, in conjunction 
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with reduction in the activity of the CoaA enzyme, provides a means to reduce 
pantothenate kinase activity to the desired level. 

Several homologues of the B, subtilis coaX gene were identified by homology 
searching of various publically available databases using the published yacB (coaX) 
5 open reading frame sequence and predicted amino acid sequence (as set forth in SEQ ID 
NOs:84 and 85 respectively). In two cases (Mycobacterium tuberculosis and 
Streptomyces coelicolor) the homologous coaX genes are adjacent to, or almost adjacent 
to, pantothenate biosynthetic genes, consistent with these homologs having a role in 
pantothenate metabolism. The CoaX proteins show no homology to the CoaA family of 
10 pantothenate kinases, nor to the eukaryotic family of pantothenate kinases exemplified 
by PanIC of Saccharomyces cerevisiae. 

Alignment of the amino acid sequences of several bacterial CoaX homologs with 
the amino acid sequence predicted from translating the B. subtilis yacB ORF described 
in the published B. subtilis genome sequence revealed that the CoaX proteins from other 
1 5 bacteria contained additional amino acid residues at their carboxy-terminal ends. 
Moreover, these extensions beyond the end of the predicted amino acid for the B. 
subtilis gene product contained two relatively well conserved segments of sequence. 

Translation of nucleotide sequences just downstream from the stop codon of the 
B. subtilis yacB ORF in a different reading frame revealed the existence of amino acid 
20 sequences very similar to the carboxy-terminal extensions of the other bacterial CoaX 
proteins. It is thus believed that an error exists in the published DNA sequence of the B. 
subtilis yacB ORF sequence that causes a frame shift leading to an artifactual 
downstream amino acid sequence and premature termination. 

The PCR-generated sequences of B. subtilis CoaX in pAN341 and pAN342 
25 (described above) contain enough downstream flanking sequence to encode the putative 
carboxy-terminal extension described above, which is consistent with the result that the 
clones were functional in the complementation assay. However when the 3' PCR 
primer was positioned to include only the shorter yacB ORF predicted from the 
published sequence, but not to include the putative carboxy-terminal extension, then the 
30 resulting plasmids, pAN329 and pAN330 (similar in structure to pAN341 and pAN342; 
see Figure 22), did not complement the defect in YH1 . This result supports the notion 
that the published yacB coding sequence contains a frame-shift error, and that the 
carboxy-terminal end of CoaX is necessary for pantothenate kinase activity. The 
predicted correct nucleotide sequence for B. subtilis coaX is set forth as SEQ ID NO: 19 
35 and the translated amino acid sequence is set forth as SEQ ID NO:9. A multiple 
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sequence alignment of the CoaX amino acid sequences of B. subtilis and 1 1 homologues 
thereof is set forth in Figure 23. 

EXAMPLE XV: Generation of mutant coaA genes encoding pantothenate kinase 
5 having reduced or temperature sensitive activities 

This Example describes strategies for modifying the coaA gene {i.e., by 
introducing point mutations) to reduce the activity of pantothenate kinase after coaA'is 
deleted from the genome. 

10 Cloning and sequencing of the temperature sensitive allele of the £ coli coaA gene. 

Two E. coli strains, each exhibiting a different mutant CoaA phenotype, were 
obtained from the E. coli Genetic Stock Center. Strain DV62 contains the coaA15(Ts) 
allele, and DV79 contains the coaA16(Fr) mutation. DV62 is temperature sensitive at 
43 °C and produces a pantothenate kinase that is temperature sensitive. DV79 was 

15 obtained by reversion of DV62 to temperature resistance, and it produces a temperature 
stable, feedback resistant pantothenate kinase activity. Since the DNA sequences of 
these alleles are not available in the literature, the coaA genes from the two mutant 
strains were cloned by PGR and sequenced, in addition to a coaA gene from a strain that 
is wild type at the coaA locus, MM294. The PCR primer at the 5' end was designed to 

20 include the start codon plus four bases upstream, and added an arbitrarily chosen 
ribosome binding site (RBS). The three PCR generated fragments were each ligated 
between theJfta/and BamHI sites of pAN229 to give pAN284 (from coaAl5(Ts)\ 
pAN285 (from wild type coaA\ and pAN286 (from coaA16(Fr)). pAN229 is a low 
copy K coli vector that provides expression from the Pis promoter and that can integrate 

25 by single crossover at bpr in B. subtilis with tetracycline selection. 

All three plasmids were transformed into the E. coli strain YH1 for 
complementation testing. All three plasmids complemented the temperature sensitive 
coaA mutation in E. coli YH1. It is presumed that the coaA 1 5 (Ts) gene in pAN284 is 
probably significantly overexpressed relative to the normal chromosomal gene, such that 
30 the overproduction compensates for the temperature sensitive defect. Complementation 
of a defect by overproduction is a well-documented phenomenon in E. coli. 

The coaA coding regions from pAN284, 285, and 286 were subcloned into 
pGEM7 to give pAN306, 307, and 308, respectively, for DNA sequencing. As 
expected, the DNA sequence of the insert in pAN307 (from wild type coaA) matched 
35 the coaA sequence from the E. coli genome database (GenBank™). The sequence from 
pAN306 contains a single base change that causes a S176L substitution {i.e., a Ser -> 
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Leu substitution in the amino acid sequence set forth as SEQ ID NO:2). Interestingly, 
the DNA sequence of the pAN308 insert, derived from the feedback resistant strain, was 
identical to that derived from its temperature sensitive parent (represented in pAN306). 
This is in accord with the genetic data that indicates that the reversion of the temperature 
5 sensitive mutation occurred at a second site unlinked to the coaA gene. 

The S176L mutation, predicted to cause the temperature sensitive defect in £ 
coli pantothenate kinase, changed a serine residue that is conserved in all known or 
suspected bacterial coaA encoded pantothenate kinases, including that of A subtilis (see 
SEQ ID NO:3 and refer to alignment). Based on this, a serine to leucine change at the 

10 homologous residue in the B. subtilis pantothenate kinase is predicted to result in either 
a temperature sensitive enzyme or one which is less active. Accordingly, to produce a 
mutant B. subtilis coaA gene, this specific change was introduced into the B. subtilis 
coaA gene. The mutant version is installed in the chromosome of a B. subtilis strain 
deleted for coaX, for example, and the recombinant microorganism is checked for 

1 5 temperature sensitivity (e.g. , reduced growth at 43°C). The mutation is then installed 
into a pantothenate overproducing strain, preferably a strain deleted for the above 
mentioned coaXgene by standard methods to give strains favorable for pantothenate 
production in B. subtilis , i.e., a strain that has reduced pantothenate kinase activity under 
typical fermentation conditions. 

20 

Additional coaA point mutations resulting in reduced pantothenate kinase activity 

Of course it is expected that many other point mutations or combinations of more 
than one point mutation in B. subtilis coaA will also lead to reduced activity. 
Appropriate mutations can be generated by mutagenic polymerase chain reaction and in 
25 vitro recombination, and identified by screening for alleles that poorly complement the 
£ coli coaA 1 5 (Ts) mutant. An example of such a mutation of this type is a tyrosine to 
histidine substitution at amino acid 181 of B. subtilis coaA y generated by mutagenic 
polymerase chain reaction (see SEQ ID NO:3 and first line of the alignment of Figure 
24). 

30 Isolate pAN282 A was derived from the middle-sized B. subtilis coaA open 

reading frame described in Example XII. pAN282A complemented the £ coli 
coaA15(Ts) mutant very poorly, but nonetheless at a level that was detectable above 
background. As was done for the £ coli coaA clones, the open reading frame from 
pAN282A was subcloned into pGEM7 to give pAN303. The DNA sequence of the 

35 insert in pAN303 showed a single base change that led to a tyrosine to histidine amino 
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acid change at the tyrosine corresponding to Y181 of SEQ ID NO:3. This tyrosine 
residue is conserved in all bacterial coaA genes/homologues present in GenBank (Figure 
24). This tyrosine residue and the serine that is altered in the E. coli temperature 
sensitive pantothenate kinase described above are separated by only three amino acid 
5 residues in a region which is highly conserved in bacterial pantothenate kinases whereas 
the DNA sequence of a second isolate of the middle-sized open reading frame, from 
pAN282B, was identical to the wild type sequence from the B. subtilis genome 
sequencing project. The single base change found in pAN303 probably occurred during 
PCR amplification of the coaA gene. If this variant of coaA2 has sufficient residual 
1 0 biological activity in B. subtilis, it may be useful in the future for providing reduced 
pantothenate kinase activity. 

A preferred plasmid that can serve as a basis for mutagenizing the coaA open 
reading frame is pAN294 (see e.g., Figure 25 and Example XII). Briefly, mutagenic 
PCR is performed using pAN294 as a template and variants of coaA having reduced 

15 pantothenate kinase activity are screened as described above. Alternatively, mutations 
such as the one isolated in pAN282A can be installed into pAN294. The desired 
mutation is then introduced into the chromosome of a B. subtilis strain by transformation 
with the appropriate pAN294 derivative and selected for chloramphenicol resistance at 5 
mg/L. Among the resulting transformants will be isolates that contain the desired 

20 mutation. 

In a similar fashion, mutations that reduce the activity of the CoaX enzyme can 
be generated and identified, and such mutations used for optimizing pantothenate 
production by reducing CoA production as described above. 

25 EXAMPLE XVI: Deleting the second pantothenate kinase gene, coaX gene from 
B. subtilis 

With the knowledge gained above concerning the existence and nature of coaX, 
one can create a deletion of the coaX open reading frame from the B. subtilis 
chromosome that will remove the encoded activity, and that will not adversely affect the 
30 expression of the genes downstream from coaX. In such a deleted strain, the coaA gene 
will be the only gene that encodes pantothenate kinase. 
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To delete the coaX gene from B. subtilis, plasmid pAN336 (SEQ ID NO:92), 
which contains upstream and downstream homology for double crossover, was 
constructed with a kanamycin resistance gene replacing most of the coaXORF (Figure 
26). Strain PY79 was transformed to kanamycin resistance by pAN336, and an isolate 
5 confirmed to have resulted from a double crossover by PCR was named PA876. As 
predicted, deletion of coaX by itself is not lethal for B. subtilis. Furthermore, 
chromosomal DNA from PA876 would not transform competent PA861 (PY79 ^coaA 
::cat) to kanamycin resistance. These results indicate that it is the combination of 
AcoaAwcat and AcoaX :: kan that is lethal for B. subtilis, confirming that B. subtilis 
10 contains two unlinked genes that encode pantothenate kanase, coaA and coaX, and that 
either gene alone is capable of supplying sufficient pantothenate kinase for a normal rate 
of growth. 

EXAMPLE XVII: Construction of a plasmid designed to allow directed 
1 5 mutagenesis of the B. subtilis coaA gene 

In order to easily introduce mutated coaA genes into the B. subtilis chromosome, 
it was necessary to install an antibiotic resistance gene adjacent to the coaA gene. This 
was accomplished by joining together in the vector pGEM5 three DNA fragments: (1) a 
3.4 kb DNA sequence containing 2.5 kb of genomic sequence upstream from coaA and 

20 the coaA open reading frame(s); (2) a 1.1 kb DNA sequence containing a 

chloramphenicol resistance gene (cat); and (3) a 1 .4 kb DNA sequence comprising a 
region downstream from the operon that contains coaA. The resulting plasmid, named 
pAN294, effectively replaces the open reading frame yqjT (the open reading frame just 
downstream from coaA) with the cat gene, with enough homology flanking both sides of 

25 the cat gene to allow double recombination into the B. subtilis chromosome (Figure 25). 
pAN294 was transformed into B. subtilis strain PY79, selecting for chloramphenicol 
resistance at 5 mg/1 to give strains PA836 and PA837, which are presumably identical. 
PA836 and 837 were checked by diagnostic PCR to show that the cat gene had 
integrated by double crossover, as opposed to single crossover. PA836 and PA837 grow 

30 normally, leading to the conclusion that the open reading frame yqjT is not essential 
(i.e., the yqjT open reading frame could be deleted from strains PA836 and PA837 with 
no significant effect on growth or pantothenate production). Thus, variant alleles (i.e., 
mutations) of the coaA gane can be introduced into pAN294 and the resulting plasmids 
can be used to introduce the variant alleles into the chromosome of, for example, a B. 

35 subtilis strain. 
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EXAMPLE XVIII: Generation of mutant coaX genes encoding pantothenate 
kinase having reduced or temperature sensitive activities 

Mutant coaX genes are generated by introducing point mutations into the gene 
and testing the resulting mutants for the ability to complement the E. coli YH1 strain as 
5 described in Example XIL Preferred mutations in the coaA'gene seqeunces are those 
that encode a substitution of a residue conserved among CoaX sequences from a variety 
of bacterial sources (e.g., a conserved residue set forth in Figure 23). Alternatively, 
random mutations in the coaX gene sequence are generated by mutagenic PCR and in 
vitro recombination and identified by screening for alleles that poorly complement the 
1 0 E. coli coaA 1 5(Ts) mutant. 

Mutants so generated (i.e., mutants having reduced coaX activity) can be further 
engineered such that the endogenous coaA gene is deleted (as described in Example 
XIII). CoaX reduced-activity mutants can also be further engineered to contain reduced- 
activity CoaA gene products as described in Example XV. 

15 

EXAMPLE XIX: Enhanced Production of Panto-Compounds Using Bacteria 
Having Deletions in One or More Pantothenate Biosynthetic Enzymes 

If the desired panto-compound is not pantothenate, then an appropriate deletion 

20 of one or more of the pantothenate biosynthetic genes from a pantothenate 

overproducing strain will provide a strain that produces said desired panto-compound. 
In this example, the desired panto-compound is pantoate. Starting with, for example, 
strain PA236, PA3 13 or PA824 either one or both of the panC and panD genes is 
deleted. In another example, ketopantoate is the desired panto-compound. Starting 

25 with, for example strain PA244, PA245 or PA824 one, two or all of the UvC,panEJ 9 
panC and panD genes are deleted from the starting strain. If p-alanine is the desired 
panto-compound, then panB and panC can be deleted, preferably in a fashion that leaves 
an in frame fusion of a small portion of the 5' end of panB with a small portion of the 
3' end of panC, from the strain PA221, PA235, PA245, or PA313. In all of the above- 

30 mentioned examples, the panto-compound producing strain will be a pantothenate 
auxotroph. Accordingly, the growth medium requires sufficient pantothenate for 
adequate growth. Vectors designed to overexpress panD as described above are then 
transformed into the above strains to further enhance p-alanine production. 

The above-mentioned deletions are accomplished by methods well-known to 

35 those skilled in the art, for example, by insertion of an antibiotic resistance gene and 
removing sufficient sequence from the target gene(s) to inactivate said target gene(s). 
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Alternatively, removal of targeted sequences is accomplished without simultaneous 
introduction of an antibiotic resistance gene in said target gene and then introduced by 
congression (co-transformation with any other appropriate selectable DNA sequence) 
followed by screening for the loss of function of said target gene by replica plating. 



Table 24 : Strains (and corresponding phenotypes) for panto-compound production 

amyE locus bpr locus Parent 



Name 


Pheno 


Drug 


panBCD 


panE locus 




type 


resist. 


locus 




PA22I 


Trp- 




P26panBCD 




PA222 






Pis panBCD 




PA235 






P26panBCD 




PA236 






Pi< DanBCD 


P-m nanK / 


PA327 


Trp- 




P26panBCD 


P26panEJ 


PA328 


Trp- 




P26panBCD 


P26panEI 


PA340 


Trp- 


Spc 


P26panBCD 


P26panEl 


PA342 


Trp- 


Spc 


P26panBCD 


P26panEl 


PA354 


Trp- 


Spc 


P26panBCD 


P26panEI 


PA365 


Trp- 


Spc, 


P26panBCD 


P26panEl 






Tet 






PA374 


Trp- 


Spc 


P26panBCD 


P26panEl 


PA377 


Tip- 


Spc, 


P26panBCD 


P26panEl 






Tet 






PA40I 


Trp- 




P26panBCD 




PA402 


Trp- 




P26panBCD 




PA403 


Trp- 




P26panBCD 




PA404 


Trp- 


Spc, 


P26panBCD 


P26panEl 






Tet 






PA405 


Trp- 


Spc, 


P26panBCD 


P26panEl 






Tet 






PA65I 


Trp- 


Spc 


P26panBC*D P26panEl 


PA284 




Spc, 


P 26' panBCD 


P26panE! 






Tet 







UvD 
locus 



P26ilvBNC 
P26ilvBNC 



P26panD423 



P26panD423 
P26panD428 
P26panD429 
P26ihBNC P26panD423 



RL-1 

PA221 
PA221 
PA235 
PA327 
PA328 
PA342 
PA354 

PA340 
PA374 

PA221 
PA221 
PA221 
PA340 



P26ilvBNC P26panD423 PA342 

P26ihBNC PA374 
P26ilvBNC P26panD423 PA377 



Equivalents Those skilled in the art will recognize, or be able to ascertain using no 
10 more than routine experimentation, many equivalents to the specific embodiments of the 
invention described herein. Such equivalents are intended to be encompassed by the 
following claims. 
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What is claimed; 

1 . A method of producing a panto-compound comprising culturing a 
microorganism which overexpresses at least one Bacillus pantothenate biosynthetic 

5 enzyme under conditions such that the panto-compound is produced. 

2. The method of claim 1, wherein the microorganism overexpresses at least 
one Bacillus subtilis pantothenate biosynthetic enzyme. 

10 3. The method of claim 1 or 2, wherein the pantothenate biosynthetic 

enzyme is selected from the group consisting of ketopantoate hydroxymethyltransferase, 
pantothenate synthetase, aspartate-a-decarboxylase and ketopantoate reductase. 

4. The method of any one of claims 1 to 3, wherein the microorganism 
15 overexpresses at least two pantothenate biosynthetic enzymes. 

5. The method of any one of claims 1 to 3, wherein the microorganism 
overexpresses at least three pantothenate biosynthetic enzymes. 

20 6. The method of any one of claims 1 to 5, wherein the panto-compound is 

selected from the group consisting of pantothenate, pantoate, ketopantoate and P- 
alanine. 

7. A method of producing a panto-compound comprising culturing a 

25 ketopantoate reductase-overexpressing (KPAR-O) microorganism under conditions such 
that the panto-compound is produced. 

8. The method of claim 7, wherein the panto-compound is pantothenate or 
pantoate. 

30 

9. The method of claim 7 or 8, wherein the ketopantoate reductase is 
bacterial -derived. 

10. The method of claim 7 or 8, wherein the ketopantoate reductase is 
35 derived from Bacillus. 
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1 1 . The method of claim 7 or 8, wherein the ketopantoate reductase is 
derived from Bacillus subtilis. 

12. The method of any one of claims 7 to 11, wherein the KPAR-0 

5 microorganism further overexpresses at least one pantothenate biosynthetic enzyme in 
addition to overexpressing ketopantoate reductase. 

13. The method of claim 12, wherein the KPAR-O microorganism further 
overexpresses at least one of ketopantoate hydroxymethyltransferase, pantothenate 

10 synthetase and aspartate-ct-decarboxylase. 

14. A method of producing pantothenate in a manner independent of 
precursor feed comprising culturing an aspartate-a-decarboxylase-overexpressing (AaD- 
O) microorganism having a deregulated isoleucine- valine (//v) pathway under conditions 

1 5 such that pantothenate is produced. 

15. A method of producing at least 2 g/L pantothenate in a manner 
independent of aspartate or p-alanine feed comprising culturing an aspartate-a- 
decarboxylase-overexpressing (AaD-O) microorganism under conditions such that 

20 pantothenate is produced. 

16. A method of producing at least 2 g/L pantothenate in a manner 
independent of valine or a-ketoisovalerate feed comprising culturing a microorganism 
having a deregulated isoleucine-valine (/7v) biosynthetic pathway under conditions such 

25 that pantothenate is produced. 

17. A method of producing at least 30 g/L pantothenate in a manner 
independent of aspartate or p-alanine feed comprising culturing an aspartate-a- 
decarboxylase-overexpressing (AaD-O) microorganism under conditions such that 

30 pantothenate is produced. 

1 8. A method of producing at least 30 g/L pantothenate in a manner 
independent of valine or a-ketoisovalerate feed comprising culturing a microorganism 
having a deregulated isoleucine-valine (ilv) biosynthetic pathway under conditions such 

35 that pantothenate is produced. 
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19. A p-alanine independent high yield production method for producing 
pantothenate comprising culturing a manipulated microorganism under conditions such 
that pantothenate is produced at a significantly high yield. 

5 20. The method of any one of claims 14 to 19, wherein the microorganism 

overexpresses acetohydroxyacid synthetase or is transformed with a vector comprising 
an ilvBN nucleic acid sequence or an alsS sequence. 

21 . The method of any one of claims 14 to 19, wherein the microorganism 
10 overexpresses acetohydroxyacid isomeroreductase or is transformed with a vector 

comprising an ilvC nucleic acid sequence. 

22. The method of any one of claims 14 to 1 9, wherein the microorganism 
overexpresses dihydroxyacid dehydratase or is transformed with a vector comprising an 

1 5 ilvD nucleic acid sequence. 

23. The method of any one of claims 19 to 22, wherein the microorganism 
overexpresses aspartate-a-decarboxylase or is transformed with a vector comprising a 
panD nucleic acid sequence. 

20 

24. The method of any one of claims 14 to 23, wherein the microorganism 
further has a deregulated pantothenate biosynthetic pathway. 

25. The method of any one of claims 1 4 to 24, wherein the microorganism 
25 further has at least one mutant gene selected from the group consisting of a mutant avtA 

gene, a mutant ilvE gene, a mutant ansB gene and a mutant alsD gene. 

26. The method of claim 24, wherein the microorganism overexpresses any 
of ketopantoate hydroxymethyltransferase, ketopantoate reductase, pantothenate 

30 synthetase and aspartate-a-decarboxylase. 

27. The method of claim 24 or 26, wherein the microorganism is transformed 
with a vector comprising a panBCD nucleic acid sequence or a vector comprising a 
panEl nucleic acid sequence. 

35 
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28. The method of any one of claims 14 to 16 and 19 to 27, wherein 
pantothenate is produced at a level selected from the group consisting of a level greater 
than lOg/L, a level greater than 20g/L and a level greater than 40g/L. 

5 29. The method of claim 20, wherein the microorganism overexpresses 

acetohydroxyacid synthetase derived from Bacillus or is transformed with a vector 
comprising an ilvBN nucleic acid sequence or an alsS nucleic acid sequence derived 
from Bacillus. 

1 0 30. The method of claim 2 1 , wherein the microorganism overexpresses 

acetohydroxyacid isomeroreductase derived from Bacillus or is transformed with a 
vector comprising an ilvC nucleic acid sequence derived from Bacillus. 

3 1 . The method of claim 22, wherein the microorganism overexpresses 
15 dihydroxyacid dehydratase derived from Bacillus or is transformed with a vector 

comprising av ilvD nucleic acid sequence derived from Bacillus. 

32. The method of claim 23, wherein the microorganism overexpresses 
aspartate-a-decarboxylase derived from Bacillus or is transformed with a vector 

20 comprising a panD nucleic acid sequence derived from Bacillus. 

33. The method of claim 24 or 26, wherein the microorganism overexpresses 
any of ketopantoate hydroxymethyltransferase, ketopantoate reductase, pantothenate 
synthetase and aspartate-a-decarboxylase derived from Bacillus. 

25 

34. The method of claim 27, wherein the vector comprises a panBCD nucleic 
acid sequence or a panEl nucleic acid sequence derived from Bacillus. 

35. A method of producing a panto-compound comprising contacting a 
30 composition comprising at least one pantothenate biosynthesis pathway precursor or 

isoleucine-valine biosynthesis pathway precursor with at least one isolated Bacillus 
enzyme selected from the group consisting of ketopantoate hydroxymethyltransferase, 
ketopantoate reductase, pantothenate synthetase and aspartate-a-decarboxylase, under 
conditions such that the panto-compound is produced. 

35 
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36. A method of producing p-alanine comprising culturing an aspartate-a- 
decarboxylase-overexpressing (AaD-O) microorganism under conditions such that P- 
alanine is produced. 

5 37. The method of claim 36, wherein the AaD-O microorganism has a 

mutation in a nucleic acid sequence encoding a pantothenate biosynthetic enzyme 
selected from the group consisting of ketopantoate hydroxymethyltransferase, 
ketopantoate reductase and pantothenate synthetase. 

10 38. A method of producing P-alanine comprising contacting a composition 

comprising aspartate with an isolated Bacillus aspartate-a-decarboxylase enzyme under 
conditions such that P-alanine is produced. 

39. A method for enhancing production of a panto-compound comprising 

15 culturing a mutant microorganism having a mutant coaX gene under conditions such that 
the panto-compound production is enhanced. 

40. The method of claim 39, wherein said recombinant microorganism has a 
mutant coaA gene. 

20 

41 . A method of producing a panto-compound comprising a pantothenate 
kinase mutant microorganism under conditions such that the panto-compound is 
produced at a significantly high yield. 

25 42. The method of claim 41, wherein said mutant microorganism has a 

mutant coaA gene. 

43. The method of claim 41, wherein said mutant microorganism has a 
mutant coaX gene. 

30 

44. The method of claim 41, where said mutant microorganism has a mutant 
coaA and coaX gene, 

45. The method of any one of claims 39 to 44, wherein said panto-compound 
35 is selected from the group consisting of ketopantoate, pantoate or pantothenate. 
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46. The method of any one of claims 39 to 44, wherein said panto-compound 
is pantothenate. 

47. The method of any one of claims 39 to 44, wherein said panto-compound 
5 is produced at a level selected from the group consisting of a level greater than lOg/L, a 

level greater than 20g/L and a level greater than 40g/L. 

48. The method of any one of claims 39 to 44, wherein said recombinant 
microorganism further has a deregulated pantothenate biosynthetic pathway or further 

10 has a deregulated isoleucine-valine (z7v) biosynthetic pathway. 

49. The method of claim any one of claims 39 to 44, wherein said 
recombinant microorganism further overexpresses panD and panE. 

1 5 50. The method of any one of claims 39 to 44, wherein said recombinant 

microorganism further has at least one mutant gene selected from the group consisting 
of a mutant avtA gene, a mutant ilvE gene, a mutant ansB gene and a mutant alsD gene. 

51. A method for enhancing production of a panto-compound comprising 

20 culturing a microorganism that has a deregulated pantothenate biosynthetic pathway and 
that also has a mutation that results in reduced pantothenate kinase activity under 
conditions such that the panto-compound production is enhanced. 

52. A method for identifying compounds which modulate pantothenate 

25 kinase activity comprising contacting a recombinant cell expressing pantothenate kinase 
encoded by the coaX gene with a test compound and determining the ability of the test 
compound to modulate pantothenate kinase activity in said cell. 

53. The method of claim 52, wherein said cell further comprises a mutant 
30 coaA gene encoding a pantothenate kinase having reduced activity. 

54. The method of any one of claims 1 to 5 1 , wherein the microorganism is 
Gram positive. 

35 55. The method of any one of claims 1 to 5 1 , wherein the microorganism is 

Gram negative. 
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56. The method of any one of claims 1 to 5 1 , wherein the microorganism is a 
microorganism belonging to a genus selected from the group consisting of Bacillus, 
Cornyebacterium, Lactobacillus , Lactococci and Streptomyces. 

5 57. The method of any one of claims 1 to 51 and 54 to 56, wherein the 

microorganism is of the genus Bacillus. 

58. The method of any one of claims 1 to 5 1 and 54 to 57, wherein the 
microorganism is Bacillus subtilis. 

10 

59. The method of any one of claims 1 to 13, 35, 39 to 5 1 and 54 to 58, 
further comprising recovering the panto-compound. 

60. The method of any one of claims 14 to 34 and 54 to 58, further 
1 5 comprising recovering the pantothenate. 

61 . The method of any one of claims 1 to 14, 35 , 39 to 46, 48 to 5 1 and 54 to 
59, wherein the panto-compound is produced at a level greater than 2 g/L. 

20 62. A recombinant microorganism which overexpresses at least one Bacillus 

pantothenate biosynthetic enzyme. 

63. The recombinant microorganism of claim 62, which overexpresses at 
least one Bacillus subtilis pantothenate biosynthetic enzyme. 

25 

64. The recombinant microorganism of claim 62 or 63, wherein the 
pantothenate biosynthetic enzyme is selected from the group consisting of ketopantoate 
hydroxymethyltransferase, pantothenate synthetase, aspartate-a~decarboxylase and 
ketopantoate reductase. 

30 

65. The recombinant microorganism of any one of claims 62 to 64, wherein 
the pantothenate biosynthetic enzyme is ketopantoate reductase. 

66. A recombinant microorganism which overexpresses aspartate-a- 
35 decarboxylase and has a deregulated isoleucine-valine (ilv) biosynthetic pathway. 
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67. A recombinant microorganism having a mutant coaX gene, said mutant 
coaXgtne encoding reduced pantothenate kinase activity in said microorganism. 

68. The recombinant microorganism of claim 67 further having a mutant 

5 coaA gene, said mutant coaA gene encoding reduced pantothenate kinase activity in said 
microorganism. 

69. A recombinant microorganism having a mutant coaX gene and optionally 
having a mutant coaA gene, said mutant microorganism having reduced pantothenate 

10 kinase activity as compared to a microorganism having wild-type coaA and coaX genes. 

70. A recombinant microorganism comprising a vector comprising an 
isolated coaX gene. 

15 71. A recombinant microorganism that overproduces a panto-compound, the 

microorganism having a deregulated pantothenate biosynthetic pathway and having at 
least one mutation that results in a decrease in the capacity of the microorganism to 
synthesize Coenzyme A (CoA). 

20 72. The recombinant microorganism of claim 71 , having at least one 

mutation that results in a reduced level of pantothenate kinase activity. 

73. The recombinant microorganism of claim 72, having a mutation in a 
coaA gene, or homologue thereof, that results in a reduced level of CoaA enzyme 

25 activity. 

74. The recombinant microorganism of claim 72, having a mutation in a 
coaX gene, or homologue thereof, that results in a reduced level of CoaX enzyme 
activity. 

30 

75. The recombinant microorganism of claim 72, having a mutation in a 
coaA gene, or homologue thereof, and having a mutation in a coaX gene, or homologue 
thereof, the mutations resulting in reduced levels of CoaA enzyme activity and reduced 
CoaX enzyme activity. 

35 

76. The recombinant microorganism of any one of claims 66 to 70 which 
further has a deregulated pantothenate biosynthetic pathway. 
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77. The recombinant microorganism of any one of claims 62 to 65 and 67 to 
75, further having a deregulated isoleucine-valine (//v) biosynthetic pathway. 

5 78. The recombinant microorganism of any one of claims 62 to 77, which is 

Gram positive. 

79. The recombinant microorganism of claim 78 belonging to a genus 
selected from the group consisting of Bacillus* Corny ebacterium, Lactobacillus* 

1 0 Lactococci and Streptomyces. 

80. The recombinant microorganism of claim 79 belonging to the genus 
Bacillus. 

15 81 . The recombinant microorganism of claim 80 which is Bacillus subtilis. 

82. A recombinant microorganism selected from the group consisting of 
PA221, PA235, PA236, PA313, PA410, PA402, PA403, PA411, PA412, PA413, 
PA303, PA327, PA328, PA401, PA340, PA342, PA404, PA405, PA374, PA354, 

20 PA365, PA377, PA65 1 and PA824. 

83. A recombinant vector for use in the production of panto-compounds 
comprising a nucleic acid sequence which encodes at least one Bacillus pantothenate 
biosynthetic enzyme operably linked to regulatory sequences. 

25 

84. The vector of claim 83, comprising a nucleic acid sequence which 
encodes at least one Bacillus subtilis pantothenate biosynthetic enzyme. 

85. The vector of claim 84, wherein the nucleic acid sequence encodes at 
30 least one of ketopantoate hydroxymethyltransferase, pantothenate synthetase, aspartate- 

a-decarboxylase and ketopantoate reductase. 



35 



86. A recombinant vector comprising at least one nucleic acid sequence 
selected from the group consisting of SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, 
SEQ ID NO:29 and SEQ ID NO:59. 
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87. The vector of claim 84, wherein the nucleic acid sequence encodes 
ketopantoate reductase. 

88. A vector comprising a mutant coaX gene, said mutant encoding a 
5 pantothenate kinase enzyme having reduced activity. 

89. A vector comprising an isolated coaX gene. 

90. A vector comprising an isolated Bacillus coaX gene. 

10 

91. A vector comprising an isolated Bacillus subtilis coaX gene. 

92. The vector of any one of claims 86 and 89 to 91 , which further comprises 
regulatory sequences. 

15 

93. The vector of any one of claims 83 to 85, 87 and 92, wherein the 
regulatory sequences comprise a constitutively active promoter. 

94. The vector of claim 93, wherein the constitutively active promoter 
20 comprises P veg (SEQ ID NO:41), P I5 (SEQ ID NO:39) or P 26 (SEQ ID NO:40) 

sequences. 

95. The vector of claim 83, wherein the regulatory sequences comprise at 
least one artificial ribosome binding site (RBS). 

25 

96. The vector of claim 95, wherein the artificial RBS comprises a nucleotide 
sequence selected from the group consisting of SEQ ID NO:49, SEQ ID NO:50, SEQ ID 
NO:51, SEQ ID NO:52, SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:55, SEQ ID 
NO:56 and SEQ IDNO:57. 

30 

97. A vector selected from the group consisting of pAN004, pAN005, 
pAN006, pAN236, pAN423 3 pAN428, pAN429, pAN441, pAN442, pAN443, pAN251, 
pAN267, pAN256, pAN257, pAN263, pAN240, pAN294, pAN296, pAN336, pAN341 
and pAN342. 

35 

98. A recombinant microorganism comprising the vector of claim 86 or 93. 
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99. An isolated nucleic acid molecule which encodes at least one Bacillus 
pantothenate biosynthetic gene. 

1 00. The isolated nucleic acid molecule of claim 99 which encodes at least 
5 one Bacillus subtilis pantothenate biosynthetic gene. 

101. The isolated nucleic acid molecule of claim 99 or 1 00 which encodes 
ketopantoate reductase. 

10 1 02. An isolated Bacillus pantothenate biosynthetic enzyme polypeptide. 

1 03. An isolated Bacillus subtilis pantothenate biosynthetic enzyme 
polypeptide. 

15 1 04. An isolated Bacillus ketopantoate reductase polypeptide. 

105. An isolated Bacillus subtilis ketopantoate reductase polypeptide. 

1 06. An isolated Bacillus aspartate-a-decarboxylase polypeptide. 

20 

1 07. An isolated Bacillus subtilis aspartate-a-decarboxylase polypeptide. 

108. An isolated nucleic acid molecule comprising a mutant coaX gene. 
25 1 09. An isolated nucleic acid molecule comprising a coaX gene. 

110. An isolated pantothenate kinase protein encoded by a coaX gene. 
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Figure 2. Plasmid pAN240, containing sequences ligated upstream of 
the P> 6 panBCD cassette, equivalent to the integrated version in strain 
PA221. 
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FigtirelH Plasmid pAN004, containing the panBCD operon expressed from P26 and RBSl. 




' n « f/i 
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Figurt5QPlasmidpAN006, containing the panBCD operon expressed from P26 and RBS2. 
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Figure Plasmid pAN236, Containing an integratable and amplifiabh P26-BBS2-panEl 
expression cassette. w r 
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FtgiireS Construction ofpAX423 
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Figure Construction ofpAN426 andpAN427. 



pAN420 X (Nsll ♦ Xbal) 
[1161 bp] 




pAN420 X (Nsil + BsiWl) 
(2393 bp] 



Swil Pad 

CTAGATTAGAAAGGAOGATTTAAAT ATfr TAT COT AC CTAGATTAGAAAGGAGGTTTAATTA TAT C0T AC 
TAATCTTTCCTCCTAAATTTA TA£ ATA CCA TG- TAATCTTTCCTCCAAATTAAT TAC ATA OCA 



NDA ollgo cassette 



NOB ollgo cassette 
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Figure 7 Construction ofpAS42S and pAX429. 
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Figure $. Construction of pASJJL 



BamHI 



Hindlll 




Swal HindIM BspEl BsiWl 
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iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 



panC 

NDIA PCR product X Xbal 
[902 bp) 



pASK-1 BA3 X [Xbal + Smal] 

[3169 bp] 



T4 DNA ligase 




Hindlll 
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Figure 9. Construction ofpA\44L 



XbaJ Swal 




di/l 



Bgllf 



SspE 



PAN431 X [Nsil + BsiWI] 

[2055 bp] 




T4 DNA ligase 



PAN420 X [Nsil + BsiWI] 

(2395 bp] 
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Figure 10. Construction ofpA\44Q. 
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Figure 12. Structure ofpAN267, a plasmid designed to stably integrate a P 2 e ilvBNC 
cassette at the amyE locus. 
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Figure 13 Structure ofpAN257, a clone ofB. subtilis ilvD in a low copy vector. 
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Figure t*{ Structure ofpAN263, designed to stably integrate a single copy of 
P 26 ilvD at the ilvD locus. ~ - «-» 
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Figure t* Structure of p AN 296, designed to delete most of the B. subtilis coa A gene and substitute a 
chloramphenicol resistance gene. 
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Figure I ? Structure ofpAN281, a plasmid for expressing B- subtilis coaA after integration 
at the bpr locus. pAN282 and pAN283 have similar structures. 
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Figure2(c Structure ofpAN336, a ptasmid designed to delete B. snbtilis coaX from the 
m chromosome and replace it with a kanamycin resistance gene. 
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SEQUENCE LISTING 

<110> OMNIGENE BIOPRODUCTS 

<120> METHODS AND MICROORGANISMS FOR PRODUCTION OF 
PANTO-COMPOUNDS . 

<130> BGI-14 1CPPC 

<140> 
<141> 

<150> USSN 09/400,494 
<151> 1999-09-21 

<150> USSN 60/210,072 
<151> 2000-06-07 

<150> USSN 60/221,836 
<151> 2000-07-28 

<150> USSN 60/221,836 
<151> 2000-08-24 

<160> 94 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 311 
<212> PRT 

<213> Haemophilus influenzae 
<400> 1 

Met Glu Phe Ser Thr Gin Gin Thr Pro Phe Leu Ser Phe Asn Arg Glu 
15 10 15 

Gin Trp Ala Glu Leu Arg Lys Ser Val Pro Leu Lys Leu Thr Glu Gin 
20 25 30 

Asp Leu Lys Pro Leu Leu Gly Phe Asn Glu Asp Leu Ser Leu Asp Glu 
35 40 45 

Val Ser Thr lie Tyr Leu Pro Leu Thr Arg Leu lie Asn Tyr Tyr lie 
50 55 60 

Asp Glu Asn Leu His Arg Gin Thr Val Leu His Arg Phe Leu Gly Arg 
65 70 75 80 

Asn Asn Ala Lys Thr Pro Tyr lie lie Ser lie Ala Gly Ser Val Ala 
85 90 95 

Val Gly Lys Ser Thr Ser Ala Arg lie Leu Gin Ser Leu Leu Ser His 
100 105 110 

Trp Pro Thr Glu Arg Lys Val Asp Leu He Thr Thr Asp Gly Phe Leu 
115 120 125 
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Tyr Pro Leu Asn Lys Leu Lys Gin Asp Asn Leu Leu Gin Lys Lys Gly 
130 135 140 

Phe Pro Val Ser Tyr Asp Thr Pro Lys Leu lie Arg Phe Leu Ala Asp 
145 150 155 160 

Val Lys Ser Gly Lys Ser Asn Val Thr Ala Pro He Tyr Ser His Leu 
165 170 175 

Thr Tyr Asp He He Pro Asp Lys Phe Asp Val Val Asp Lys Pro Asp 
180 185 190 

He Leu He Leu Glu Gly Leu Asn Val Leu Gin Thr Gly Asn Asn Lys 
195 200 205 

Thr Asp Gin Thr Phe Val Ser Asp Phe Val Asp Phe Ser He Tyr Val 
210 215 220 

Asp Ala Glu Glu Lys Leu Leu Lys Glu Trp Tyr lie Lys Arg Phe Leu 
225 230 235 240 

Lys Phe Arg Glu Ser Ala Phe Asn Asp Pro Asn Ser Tyr Phe Lys His 
245 250 255 

Tyr Ala Ser Leu Ser Lys Glu Glu Ala He Ala Thr Ala Ser Lys He 
260 265 270 

Trp Asp Glu He Asn Gly Leu Asn Leu Asn Gin Asn He Leu Pro Thr 
275 280 285 

Arg Glu Arg Ala Asn Leu He Leu Lys Lys Gly His Asn His Gin Val 
290 2 95 300 

Glu Leu lie Lys Leu Arg Lys 
305 310 



<210> 2 
<211> 316 
<212> PRT 

<213> Escherichia coli 
<400> 2 

Met Ser He Lys Glu Gin Thr Leu Met Thr Pro Tyr Leu Gin Phe Asp 
15 10 15 

Arg Asn Gin Trp Ala Ala Leu Arg Asp Ser Val Pro Met Thr Leu Ser 
20 ' 25 30 

Glu Asp Glu He Ala Arg Leu Lys Gly He Asn Glu Asp Leu Ser Leu 
35 40 45 

Glu Glu Val Ala Glu He Tyr Leu Pro Leu Ser Arg Leu Leu Asn Phe 
50 55 60 

Tyr He Ser Ser Asn Leu Arg Arg Gin Ala Val Leu Glu Gin Phe Leu 
65 70 75 80 
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Gly Thr Asn Gly Gin Arg He Pro Tyr He He Ser He Ala Gly Ser 
85 90 95 

Val Ala Val Gly Lys Ser Thr Thr Ala Arg Val Leu Gin Ala Leu Leu 
100 105 110 

Ser Arg Trp Pro Glu His Arg Arg Val Glu Leu He Thr Thr Asp Gly 
115 120 125 

Phe Leu His Pro Asn Gin Val Leu Lys Glu Arg Gly Leu Met Lys Lys 
130 135 140 

Lys Gly Phe Pro Glu Ser Tyr Asp Met His Arg Leu Val Lys Phe Val 
145 150 155 160 

Ser Asp Leu Lys Ser Gly Val Pro Asn Val Thr Ala Pro Val Tyr Ser 
165 170 175 

His Leu He Tyr Asp Val He Pro Asp Gly Asp Lys Thr Val Val Gin 
180 185 190 

Pro Asp He Leu lie Leu Glu Gly Leu Asn Val Leu Gin Ser Gly Met 
195 200 205 

Asp Tyr Pro His Asp Pro His His Val Phe Val Ser Asp Phe Val Asp 
210 215 220 

Phe Ser lie Tyr Val Asp Ala Pro- Glu Asp Leu Leu Gin Thr Trp Tyr 
225 230 235 240 

He Asn Arg Phe Leu Lys Phe Arg Glu Gly Ala Phe Thr Asp Pro Asp 
245 250 255 

Ser Tyr Phe His Asn Tyr Ala Lys Leu Thr Lys Glu Glu Ala lie Lys 
260 265 270 

Thr Ala Met Thr Leu Trp Lys Glu lie Asn Trp Leu Asn Leu Lys Gin 
275 280 285 

Asn lie Leu Pro Thr Arg Glu Arg Ala Ser Leu lie Leu Thr Lys Ser 
290 295 300 

Ala Asn His Ala Val Glu Glu Val Arg Leu Arg Lys 
305 310 315 



<210> 3 
<211> 319 
<212> PRT 

<213> Bacillus subtilis 
<400> 3 

Met Lys Asn Lys Glu Leu Asn Leu His Thr Leu Tyr Thr Gin His Asn 
15 10 15 

Arg Glu Ser Trp Ser Gly Phe Gly Gly His Leu Ser lie Ala Val Ser 
20 25 30 



WO 01/21772 



PCT/US00/25993 



Glu Glu Glu Ala Lys Ala Val Glu Gly Leu Asn Asp Tyr Leu Ser Val 
35 40 45 

Glu Glu Val Glu Thr lie Tyr lie Pro Leu Val Arg Leu Leu His Leu 
50 55 60 

His Val Lys Ser Ala Ala Glu Arg Asn Lys His Val Asn Val Phe Leu 
65 70 75 80 

Lys His Pro His Ser Ala Lys lie Pro Phe lie lie Gly lie Ala Gly 
85 90 95 

Ser Val Ala Val Gly Lys Ser Thr Thr Ala Arg lie Leu Gin Lys Leu 
100 105 110 

Leu Ser Arg Leu Pro Asp Arg Pro Lys Val Ser Leu lie Thr Thr Asp 
115 120 125 

Gly Phe Leu Phe Pro Thr Ala Glu Leu Lys Lys Lys Asn Met Met Ser 
130 135 140 

Arg Lys Gly Phe Pro Glu Ser Tyr Asp Val Lys Ala Leu Leu Glu Phe 
145 150 155 160 

Leu Asn Asp Leu Lys Ser Gly Lys Asp Ser Val Lys Ala Pro Val Tyr 
165 170 ~ 175 

Ser His Leu Thr Tyr Asp Arg Glu Glu Gly Val Phe Glu Val Val Glu 
180 185 190 

Gin Ala Asp lie Val He He Glu Gly He Asn Val Leu Gin Ser Pro 
195 200 205 

Thr Leu Glu Asp Asp Arg Glu Asn Pro Arg He Phe Val Ser Asp Phe 
210 215 220 

Phe Asp Phe Ser He Tyr Val Asp Ala Glu Glu Ser Arg lie Phe Thr 
225 230 235 240 

Trp Tyr Leu Glu Arg Phe Arg Leu Leu Arg Glu Thr Ala Phe Gin Asn 
245 250 255 

Pro Asp Ser Tyr Phe His Lys Phe Lys Asp Leu Ser Asp Gin Glu Ala 
260 265 270 

Asp Glu Met Ala Ala Ser He Trp Glu Ser Val Asn Arg Pro Asn Leu 
275 280 285 

Tyr Glu Asn He Leu Pro Thr Lys Phe Arg Ser Asp Leu lie Leu Arg 
290 295 300 



Lys Gly Asp Gly His Lys Val Glu Glu Val Leu Val Arg Arg Val 
305 310 315 



<210> 4 
<211> 312 
<212> PRT 



WO 01/21772 



PCT/US00/25993 



- 5 - 



<213> Mycobacterium leprae 



<400> 4 
Met Pro 
1 



Arg 



Leu Ser Glu 
5 



Pro 



Ser Pro Tyr Val Glu Phe 
10 



Asp Arg 
15 



Lys 



Gin Trp Arg Ala Leu Arg Met Ser Thr Pro Leu Ala Leu Thr Glu Glu 
20 25 30 

Glu Leu He Gly Leu Arg Gly Leu Gly Glu Gin He Asp Leu Leu Glu 
35 40 45 

Val Glu Glu Val Tyr Leu Pro Leu Ala Arg Leu He His Leu Gin Val 
50 55 60 

Ala Ala Arg Gin Arg Leu Phe Ala Ala Thr Ala Glu Phe Leu Gly Glu 
65 70 75 80 

Pro Gin Gin Asn Pro Gly Arg Pro Val Pro Phe He lie Gly Val Ala 
85 90 95 

Gly Ser Val Ala Val Gly Lys Ser Thr Thr Ala Arg Val Leu Gin Ala 
100 105 110 

Leu Leu Ala Arg Trp Asp His His Thr Arg Val Asp Leu Val Thr Thr 
115 120 125 

Asp Gly Phe Leu Tyr Pro Asn Ala Glu Leu Gly Arg Arg Asn Leu Met 
130 135 140 

His Arg Lys Gly Phe Pro Glu Ser Tyr Asn Arg Arg Ala Leu Met Arg 
145 150 155 160 

Phe Val Thr Ser Val Lys Ser Gly Ala Asp Tyr Ala Cys Ala Pro Val 
165 170 175 

Tyr Ser His Leu Arg Tyr Asp Thr He Pro Gly Ala Lys His Val Val 
180 185 190 

Arg His Pro Asp He Leu He Leu Glu Gly Leu Asn Val Leu Gin Thr 
195 200 205 

Gly Pro Thr Leu Met Val Ser Asp Leu Phe Asp Phe Ser Leu Tyr Val 
210 215 220 

Asp Ala Arg He Gin Asp He Glu Gin Trp Tyr Val Ser Arg Phe Leu 
225 230 235 240 

Ala Met Arg Gly Thr Ala Phe Ala Asp Pro Glu Ser His Phe His His 
245 250 255 

Tyr Ser Ala Leu Thr Asp Ser Lys Ala He He Ala Ala Arg Glu He 
260 265 270 

Trp Arg Ser lie Asn Arg Pro Asn Leu Val Glu Asn lie Leu Pro Thr 
275 280 285 

Arg Pro Arg Ala Thr Leu Val Leu Arg Lys Asp Ala Asp His Ser lie 
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290 



295 



300 



Asn Arg Leu Arg Leu 
305 



Arg Lys Leu 
310 



<210> 5 
<211> 312 
<212> PRT 

<213> Mycobacterium tuberculosis 
<400> 5 

Met Ser Arg Leu Ser Glu Pro Ser Pro Tyr Val Glu Phe Asp Arg Arg 
15 10 15 

Gin Trp Arg Ala Leu Arg Met Ser Thr Pro Leu Ala Leu Thr Glu Glu 
20 25 30 

Glu Leu Val Gly Leu Arg Gly Leu Gly Glu Gin He Asp Leu Leu Glu 
35 40 45 

Val Glu Glu Val Tyr Leu Pro Leu Ala Arg Leu He His Leu Gin Val 
50 55 60 

Ala Ala Arg Gin Arg Leu Phe Ala Ala Thr Ala Glu Phe Leu Gly Glu 
65 10 75 80 

Pro Gin Gin Asn Pro Asp Arg Pro Val Pro Phe He He Gly Val Ala 
85 90 95 

Gly Ser Val Ala Val Gly- Lys Ser Thr Thr Ala Arg Val Leu Gin Ala 
100 105 110 

Leu Leu Ala Arg Trp Asp His His Pro Arg Val Asp Leu Val Thr Thr 
115 120 125 

Asp Gly Phe Leu Tyr Pro Asn Ala Glu Leu Gin Arg Arg Asn Leu Met 
130 135 140 

His Arg Lys Gly Phe Pro Glu Ser Tyr Asn Arg Arg Ala Leu Met Arg 
145 " 150 155 160 

Phe Val Thr Ser Val Lys Ser Gly Ser Asp Tyr Ala Cys Ala Pro Val 
165 170 175 

Tyr Ser His Leu His Tyr Asp He lie Pro Gly Ala Glu Gin Val Val 
180 185 190 

Arg His Pro Asp He Leu He Leu Glu Gly Leu Asn Val Leu Gin Thr 
195 200 205 

Gly Pro Thr Leu Met Val Ser Asp Leu Phe Asp Phe Ser Leu Tyr Val 
210 215 220 

Asp Ala Arg He Glu Asp He Glu Gin Trp Tyr Val Ser Arg Phe Leu 
225 230 235 240 

Ala Met Arg Thr Thr Ala Phe Ala Asp Pro Glu Ser His Phe His His 
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245 



250 



255 



Tyr Ala Ala Phe Ser Asp Ser Gin Ala Val Val Ala Ala Arg Glu lie 

260 265 270 

Trp Arg Thr lie Asn Arg Pro Asn Leu Val Glu Asn lie Leu Pro Thr 

275 280 285 

Arg Pro Arg Ala Thr Leu Val Leu Arg Lys Asp Ala Asp His Ser lie 

290 295 300 

Asn Arg Leu Arg Leu Arg Lys Leu 



<210> 6 
<211> 329 
<212> PRT 

<213> Streptomyces coelicolor 
<400> 6 

Met lie Ser Pro Val Pro Ser lie Pro Arg Ser Ala His Arg Gin Arg 
15 10 15 

Pro Glu Ala Thr Pro Tyr Val Asp Leu Thr Arg Pro Glu Trp Ser Ala 
20 25 30 

Leu Arg Asp Lys Thr Pro Leu Pro Leu Thr Ala Glu Glu Val Glu Lys 
35 40 . 45 

Leu Arg Gly Leu Gly Asp Val lie Asp Leu Asp Glu Val Arg Asp lie 
■50 55 60 

Tyr Leu Pro Leu Ser Arg Leu Leu Asn Leu Tyr Val Gly Ala Thr Asp 
65 70 75 80 

Gly Leu Arg Gly Ala Leu Asn Thr Phe Leu Gly Glu Gin Gly Ser Gin 
85 90 95 

Ser Gly Thr Pro Phe Val lie Gly Val Ala Gly Ser Val Ala Val Gly 
100 105 110 

Lys Ser Thr Val Ala Arg Leu Leu Gin Ala Leu Leu Ser Arg Trp Pro 
115 120 125 

Glu His Pro Arg Val Glu Leu Val Thr Thr Asp Gly Phe Leu Leu Pro 
130 135 140 

Thr Arg Glu Leu Glu Ala Arg Gly Leu Met Ser Arg Lys Gly Phe Pro 
145 150 155 160 

Glu Ser Tyr Asp Arg Arg Ala Leu Thr Arg Phe Val Ala Asp He Lys 
165 170 175 

Ala Gly Lys Ala Glu Val Thr Ala Pro Val Tyr Ser His Leu He Tyr 
180 185 190 

Asp He Val Pro Asp Gin Arg Leu Val Val Arg Arg Pro Asp He Leu 



305 



310 
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195 200 205 

lie Val Glu Gly Leu Asn Val Leu Gin Pro Ala Leu Pro Gly Lys Asp 
210 215 220 

Gly Arg Thr Arg Val Gly Leu Ala Asp Tyr Phe Asp Phe Ser Val Tyr 
225 230 235 240 



Val Asp Ala Arg Thr Glu Asp lie 
245 

Arg Lys Leu Arg Ala Thr Ala Phe 
260 

Lys Tyr Thr Gin Val Ser Glu Glu 
275 280 

Thr Trp Arg Thr lie Asn Lys Pro 
290 295 

Thr Arg Gly Arg Ala Thr Leu Val 
305 310 



Glu Arg Trp Tyr Leu Asn Arg Phe 
250 * 255 

Gin Asn Pro Ser Ser Tyr Phe Arg 
265 270 

Glu Ala Leu Asp Tyr Ala Arg Thr 
285 

Asn Leu Val Glu Asn Val Ala Pro 
300 

Leu Arg Lys Gly Pro Asp His Lys 
315 320 



Val Gin Arg Leu Ser Leu Arg Lys Leu 
325 



<210> 7 

<211> 265 

<212> PRT 

<213> Streptomyces coelicolor 

<400> 7 

Met Leu Leu Thr lie Asp Val Gly Asn Thr His Thr Val Leu Gly Leu 
1 5 10' 15 

Phe Asp Gly Glu Asp lie Val Glu His Trp Arg lie Ser Thr Asp Ser 
20 25 30 

Arg Arg Thr Ala Asp Glu Leu Ala Val Leu Leu Gin Gly Leu Met Gly 
35 40 45 

Met His Pro Leu Leu Gly Asp Glu Leu Gly Asp Gly He Asp Gly He 
50 55 60 

Ala He Cys Ala Thr Val Pro Ser Val Leu His Glu Leu Arg Glu Val 
65 70 75 80 

Thr Arg Arg Tyr Tyr Gly Asp Val Pro Ala Val Leu Val Glu Pro Gly 
85 90 95 

Val Lys Thr Gly Val Pro He Leu Thr Asp His Pro Lys Glu Val Gly 
100 105 110 

Ala Asp Arg He He Asn Ala Val Ala Ala Val Glu Leu Tyr Gly Gly 
115 120 125 

Pro Ala lie Val Val Asp Phe Gly Thr Ala Thr Thr Phe Asp Ala Val 
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130 

Ser Ala 
145 

lie Ser 



Arg Gly Glu Tyr 
150 

Val Glu Ala Leu 
165 



Glu Val Ala Arg Pro Arg 
180 
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135 

He Gly Gly Val 



Gly Val Lys Gly 
170 

Ser Val He Gly 
185 



140 

He Ala Pro 
155 

Ala Gin Leu 



Lys Asn Thr 



Gly He Glu 
160 

Arg Lys He 
175 

Val Glu Ala 
190 



Met Gin Ser Gly He Val 
195 



Val Asn 
210 



Arg Met Ala Arg 



Tyr Gly Phe Ala 
200 

Glu Leu Ala Asp 
215 



Gly Gin Val 
205 

Asp Pro Asp 
220 



Asp Gly Val 
Asp Val Thr 



Val lie 
225 



Ala Thr Gly Gly 
230 



Val He Asp Glu His Glu 
245 

Val Tyr Glu Arg Asn Val 
260 



Leu Ala Pro Met 



Pro Trp Leu Thr 
250 

Ser Arg Met 
2 65 



Val Leu Gly 
235 

Leu Met Gly 



Glu Ser Ser 
240 

Leu Arg Leu 
255 



<210> 8 
<211> 272 
<212> PRT 

<213> Mycobacterium tuberculosis 
<400> 8 

Met Leu Leu Ala He Asp Val Arg Asn Thr His Thr Val Val Gly Leu 
15 10 15 

Leu Ser Gly Met Lys Glu His Ala Lys Val Val Gin Gin Trp Arg He 
20 25 30 

Arg Thr Glu Ser Glu Val Thr Ala Asp Glu Leu Ala Leu Thr He Asp 
35 40 45 

Gly Leu He Gly Glu Asp Ser Glu Arg Leu Thr Gly Thr Ala Ala Leu 
50 55 60 

Ser Thr Val Pro Ser Val Leu His Glu Val Arg He Met Leu Asp Gin 
65 70 75 4 80 

Tyr Trp Pro Ser Val Pro His Val Leu He Glu Pro Gly Val Arg Thr 
85 90 95 

Gly He Pro Leu Leu Val Asp Asn Pro Lys Glu Val Gly Ala Asp Arg 
100 105 110 

He Val Asn Cys Leu Ala Ala Tyr Asp Arg Phe Arg Lys Ala Ala He 
115 120 125 

Val Val Asp Phe Gly Ser Ser He Cys Val Asp Val Val Ser Ala Lys 
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130 135 140 

Gly Glu Phe Leu Gly Gly Ala He Ala Pro Gly Val Gin Val Ser Ser 
145 150 155 160 

Asp Ala Ala Ala Ala Arg Ser Ala Ala Leu Arg Arg Val Glu Leu Ala 
165 170 175 

Arg Pro Arg Ser Val Val Gly Lys Asn Thr Val Glu Cys Met Gin Ala 
180 185 190 

Gly Ala Val Phe Gly Phe Ala Gly Leu Val Asp Gly Leu Val Gly Arg 
195 200 205 

He Arg' Glu Asp Val Ser Gly Phe Ser Val Asp His Asp Val Ala He 
210 215 220 

Val Ala Thr Gly His Thr Ala Pro Leu Leu Leu Pro Glu Leu His Thr 
225 230 235 240 

Val Asp His Tyr Asp Gin His Leu Thr Leu Gin Gly Leu Arg Leu Val 
245 250 255 

Phe Glu Arg Asn Leu Glu Val Gin Arg Gly Arg Leu Lys Thr Ala Arg 
2 60 2 65 270 



<210> 9 

<211> 258 

<212> PRT • 

<213> Bacillus subtilis 

<400> 9 

Leu Leu Leu Val He Asp Val Gly Asn Thr Asn Thr Val Leu Gly Val 
15 10 15 

Tyr His Asp Gly Lys Leu Glu Tyr His Trp Arg He Glu Thr Ser Arg 
20 25 30 

His Lys Thr Glu Asp Glu Phe Gly Met He Leu Arg Ser Leu Phe Asp 
35 40 45 

His Ser Gly Leu Met Phe Glu Gin He Asp Gly He He He Ser Ser 
50 55 60 

Val Val Pro Pro He Met Phe Ala Leu Glu Arg Met Cys Thr Lys Tyr 
65 70 75 80 

Phe His He Glu Pro Gin He Val Gly Pro Gly Met Lys Thr Gly Leu 
85 90 95 

Asn He Lys Tyr Asp Asn Pro Lys Glu Val Gly Ala Asp Arg He Val 
100 105 110 

Asn Ala Val Ala Ala He His Leu Tyr Gly Asn Pro Leu He Val Val 
115 120 125 
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Asp Phe Gly Thr Ala Thr Thr Tyr Cys Tyr lie Asp Glu Asn Lys Gin 
130 135 140 

Tyr Met Gly Gly Ala He Ala Pro Gly He Thr He Ser Thr Glu Ala 
145 150 155 160 

Leu Tyr Ser Arg Ala Ala Lys Leu Pro Arg He Glu He Thr Arg Pro 
165 170 175 

Asp Asn He He Gly Lys Asn Thr Val Ser Ala Met Gin Ser Gly He 
180 185 190 

Leu Phe Gly Tyr Val Gly Gin Val Glu Gly He Val Lys Arg Met Lys 
195 200 205 

Trp Gin Ala Lys Gin Asp Leu Lys Val He Ala Thr Gly Gly Leu Ala 
210 215 220 

Pro Leu He Ala Asn Glu Ser Asp Cys lie Asp lie Val Asp Pro Phe 
225 230 235 * 240 

Leu Thr Leu Lys Gly Leu Glu Leu lie Tyr Glu Arg Asn Arg Val Gly 
245 250 255 

Ser Val 



<210> 10 
<211> 262 
<212> PRT 

<213> Deinococcus radiopugnans 
<400> 10 

Met Pro Ala Phe Pro Leu Leu Ala Val Asp lie Gly Asn Thr Thr Thr 
1 .5 10 15 

Val Leu Gly Leu Ala Asp Ala Ser Gly Ala Leu Thr His Thr Trp Arg 
20 25 30 

lie Arg Thr Asn Arg Glu Met Leu Pro Asp Asp Leu Ala Leu Gin Leu 
35 40 ' 45 

His Gly Leu Phe Thr Leu Ala Gly Ala Pro lie Pro Arg Ala Ala Val 
50 55 60 

Leu Ser Ser Val Ala Pro Pro Val Gly Glu Asn Tyr Ala Leu Ala Leu 
65 70 * 75 80 

Lys Arg His Phe Met lie Asp Ala Phe Ala Val Ser Ala Glu Asn Leu 
85 90 95 

Pro Asp Val Thr Val Glu Leu Asp Thr Pro Gly Ser Val Gly Ala Asp 



100 



105 



110 



Arg 



Leu Cys 



Asn 



Leu 



Phe 



Gly 



Ala Glu Lys Tyr Leu Gly Gly Leu Asp 
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115 



120 



125 



Tyr Ala Val Val Val Asp Phe Gly Thr Ser Thr Asn Phe Asp Val Val 
130 135 140 

Gly Arg Gly Arg Arg Phe Leu Gly Gly lie Leu Ala Thr Gly Ala Gin 
145 150 ~ 155 ~ 160 

Val Ser Ala Asp Ala Leu Phe Ala Arg Ala Ala Lys Leu Pro Arg lie 
165 170 175 

Thr Leu Gin Ala Pro Glu Thr Ala lie Gly Lys Asn Thr Val His Ala 
180 185 190 

Leu Gin Ser Gly Leu Val Phe Gly Tyr Ala Glu Met Val Asp Gly Leu 
195 200 205 

Leu Arg Arg lie Arg Ala Glu Leu Pro Gly Glu Ala Val Ala Val Ala 
210 215 220 

Thr Gly Gly Phe Ser Arg Thr Val Gin Gly lie Cys Gin Glu He Asp 
225 230 235 240 

Tyr Tyr Asp Glu Thr Leu Thr Leu Arg Gly Leu Val Glu Leu Trp Ala 
245 250 255 

Ser Arg Ser Glu Val Arg 



<210> 11 
<211> 212 
<212> PRT 

<213> Desulf ovibrio vulgaris 
<400> 11 

Met Thr Gin His Phe Leu Leu Phe Asp He Gly Asn Thr Asn Val Lys 
15 10 15 

He Gly He Ala Val Glu Thr Ala Val Leu Thr Ser Tyr Val Leu Pro 
20 25 30 

Thr Asp Pro Gly Gin Thr Thr Asp Ser lie Gly Leu Arg Leu Leu Glu 
35 40 45 

Val Leu Arg His Ala Gly Leu Gly Pro Ala Asp Val Gly Ala Cys Val 
50 55 60 

Ala Ser Ser Val Val Pro Gly Val Asn Pro Leu He Arg Arg Ala Cys 
65 70 75 80 

Glu Arg Tyr Leu Tyr Arg Lys Leu Leu Phe Ala Pro Gly Asp He Ala 
85 90 95 

He Pro Leu Asp Asn Arg Tyr Glu Arg Pro Ala Glu Val Gly Ala Asp 



260 



100 



105 



110 



Arg Leu Val Ala Ala Tyr Ala Ala Arg Arg Leu Tyr Pro Gly Pro Arg 
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115 

Ser Leu Val Ser Val 

130 

Glu Gly Gly Ala Tyr 
145 

Ser Ala Gly Ala Leu 
165 

Leu Glu Val Glu Glu 
180 

Ser Leu Asn His Gly 
195 

Val Leu Ala Ala 
210 



- 13 - 



120 

Asp Phe Gly Thr Ala Thr 
135 

Leu Gly Gly Leu lie Cys 
150 155 

Ser Ser Arg Thr Ala Lys 
170 

Asp Ser Pro Val lie Gly 
185 

Phe He Phe Gly Phe Ala 
200 



125 

Thr Phe Asp Cys Val 
140 

Pro Gly Val Leu Ser 
160 

Leu Pro Arg He Ser 
175 

Arg Ser Thr Thr Thr 
190 

Ala Met Thr Glu Gly 
205 



<210> 12 
<211> 246 
<212> PRT 

<213> Therraotoga maritima 
<400> 12 

Met Tyr Leu Leu Val Asp Val Gly Asn Thr His Ser Val Phe Ser He 
1 .5 10 15 

Thr Glu Asp Gly Lys Thr Phe Arg Arg Trp Arg Leu Ser Thr Gly Val 

20 25 . 30 

Phe Gin Thr Glu Asp Glu Leu Phe Ser His Leu His Pro Leu Leu Gly 
35 40 45 

Asp Ala Met Arg Glu He Lys Gly He Gly Val Ala Ser Val Val Pro 
50 55 60 

Thr Gin Asn Thr Val He Glu Arg Phe Ser Gin Lys Tyr Phe His He 
65 70 75 80 

Ser Pro He Trp Val Lys Ala Lys Asn Gly Cys Val Lys Trp Asn Val 
85 90 95 

Lys Asn Pro Ser Glu Val Gly Ala Asp Arg Val Ala Asn Val Val Ala 
100 105 110 

Phe Val Lys Glu Tyr Gly Lys Asn Gly He He He Asp Met Gly Thr 
115 120 125 

Ala Thr Thr Val Asp Leu Val Val Asn Gly Ser Tyr Glu Gly Gly Ala 
130 135 140 

He Leu Pro Gly Phe Phe Met Met Val His Ser Leu Phe Arg Gly Thr 
145 150 155 160 



Ala Lys Leu Pro Leu Val Glu Val Lys Pro Ala Asp Phe Val Val Gly 
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165 



170 



175 



Lys Asp Thr Glu Glu Asn lie Arg Leu Gly Val Val Asn Gly Ser Val 
180 185 190 

Tyr Ala Leu Glu Gly lie lie Gly Arg lie Lys Glu Val Tyr Gly Asp 
195 200 205 

Leu Pro Val Val Leu Thr Gly Gly Gin Ser Lys lie Val Lys Asp Met 
210 215 220 

lie Lys His Glu lie Phe Asp Glu Asp Leu Thr lie Lys Gly Val Tyr 
225 230 235 240 

His Phe Cys Phe Gly Asp 



<210> 13 

<211> 273 

<212> PRT 

<213> Treponema pallidum 

<400> 13 

Met Leu Leu lie Asp Val Gly Asn Ser His Val Val Phe Gly He Gin 
1 5 10 15 

Gly Glu Asn Gly Gly Arg Val Cys Val Arg Glu Leu Phe Arg Leu Ala 
20 25 30 

Pro Asp Ala Arg Lys Thr Gin Asp Glu Tyr Ser Leu Leu He His Ala 
35 40 45 

Leu Cys Glu Arg Ala Gly Val Gly Arg Ala Ser Leu Arg Asp Ala Phe 
50 55 60 

He Ser Ser Val Val Pro Val Leu Thr Lys Thr He Ala Asp Ala Val 
65 70 75 80 

Ala Gin He Ser Gly Val Gin Pro Val Val Phe Gly Pro Trp Ala Tyr 
85 90 95 

Glu His Leu Pro Val Arg He Pro Glu Pro Val Arg Ala Glu He Gly 
100 105 110 

Thr Asp Leu Val Ala Asn Ala Val Ala Ala Tyr Val His Phe Arg Ser 
115 120 125 

Ala Cys Val Val Val Asp Cys Gly Thr Ala Leu Thr Phe Thr Ala Val 
130 135 140 

Asp Gly Thr Gly Leu He Gin Gly Val Ala He Ala Pro Gly Leu Arg 
145 150 155 160 

Thr Ala Val Gin Ser Leu His Thr Gly Thr Ala Gin Leu Pro Leu Val 



245 



165 



170 



175 



Pro Leu Ala Leu Pro Asp Ser Val Leu Gly Lys Asp Thr Thr His Ala 
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180 

Val Gin Ala Gly Val Val Arg Gly 
195 200 

lie Ala Gin Cys Gin Lys Glu Leu 
210 215 

Thr Gly Gly Leu Ser Arg Leu Phe 
225 230 

lie Asp Ala Gin Leu Thr Leu Ser 
245 

Val Pro Thr Ser Leu Leu Pro Pro 

260 

Asn 



- 15 - 

185 190 

Thr Leu Phe Val lie Arg Ala Met 
205 

Gly Cys Arg Cys Ala Ala Val lie 
220 

Ser Ser Glu Val Asp Phe Pro Pro 
235 240 

Gly Leu Ala His lie Ala Arg Leu 
250 255 

Ala Thr Val Ser Gly Ser Ser Gly 
265 270 



<210> 14 
<211> 262 
<212> PRT 

<213> Borrelia burgdorferi 
<400> 14 

Met Asn Lys Pro Leu Leu Ser Glu Leu lie lie Asp lie Gly Asn Thr 
15 10 15 

Ser lie Ala Phe Ala Leu Phe Lys Asp Asn Gin Val Asn Leu Phe lie 
20 - 25 30 

Lys Met Lys Thr Asn Leu Met Leu Arg Tyr Asp Glu Val Tyr Ser Phe 
35 4 0 4 5 

Phe Glu Glu Asn Phe Asp Phe Asn Val Asn Lys Val Phe lie Ser Ser 
50 55 60 

Val Val Pro lie Leu Asn Glu Thr Phe Lys Asn Val lie Phe Ser Phe 
65 70 75 80 

Phe Lys lie Lys Pro Leu Phe lie Gly Phe Asp Leu Asn Tyr Asp Leu 
85 90 95 

Thr Phe Asn Pro Tyr Lys Ser Asp Lys Phe Leu Leu Gly Ser Asp Val 
100 105 110 

Phe Ala Asn Leu Val Ala Ala lie Glu Asn Tyr Ser Phe Glu Asn Val 
115 120 125 

Leu Val Val Asp Leu Gly Thr Ala Cys Thr He Phe Ala Val Ser Arg 
130 135 140 

Gin Asp Gly He Leu Gly Gly He He Asn Ser Gly Pro Leu He Asn 
145 150 155 160 

Phe Asn Ser Leu Leu Asp Asn Ala Tyr Leu He Lys Lys Phe Pro lie 
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165 



170 



175 



Ser Thr Pro Asn Asn Leu Leu Glu Arg Thr Thr Ser Gly Ser Val Asn 
180 185 190 

Ser Gly Leu Phe Tyr Gin Tyr Lys Tyr Leu lie Glu Gly Val Tyr Arg 
195 200 205 

Asp lie Lys Gin Met Tyr Lys Lys Lys Phe Asn Leu lie lie Thr Gly 

210 215 220 

Gly Asn Ala Asp Leu lie Leu Ser Leu lie Glu lie Glu Phe He Phe 
225 230 235 240 

Asn He His Leu Thr Val Glu Gly Val Arg He Leu Gly Asn Ser He 

245 250 255 

Asp Phe Lys Phe Val Asn 



<210> 15 
<211> 229 
<212> PRT 

<213> Aquifex aeolicus 
<400> 15 

Met Arg Phe Leu Thr Val Asp Val Gly Asn Ser Ser Val Asp lie Ala 
15 10 15 

Leu Trp Glu Gly Lys Lys Val Lys Asp Phe Leu Lys Leu Ser His Glu 



Glu Phe Leu Lys Glu Glu Phe Pro Lys Leu Lys Ala Leu Gly He Ser 

35 40 45 

Val Lys Gin Ser Phe Ser Glu Lys Val Arg Gly Lys He Pro Lys He 

50 55 60 

Lys Phe Leu Lys Lys Glu Asn Phe Pro He Gin Val Asp Tyr Lys Thr 

65 70 75 80 

Pro Glu Thr Leu Gly Thr Asp Arg Val Ala Leu Ala Tyr Ser Ala Lys 

85 ^ 90 * 95 

Lys Phe Tyr Gly Lys Asn Val Val Val He Ser Ala Gly Thr Ala Leu 

100- 105 110 

Val He Asp Leu Val Leu Glu Gly Lys Phe Lys Gly Gly Phe He Thr 

115 120 125 

Leu Gly Leu Gly Lys Lys Leu Lys He Leu Ser Asp Leu Ala Glu Gly 

130 135 ~ 140 

He Pro Glu Phe Phe Pro Glu Glu Val Glu He Phe Leu Gly Arg Ser 

145 150 155 160 



260 



20 



25 



30- 



Thr Arg Glu Cys Val Leu Gly Gly Ala Tyr Arg Glu Ser Thr Glu Phe 
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165 170 175 

lie Lys Ser Thr Leu Lys Leu Trp Arg Lys Val Phe Lys Arg Lys Phe 
180 185 190 

Lys Val Val He Thr Gly Gly Glu Gly Lys Tyr Phe Ser Lys Phe Gly 
195 200 205 

He Tyr Asp Pro Leu Leu Val His Arg Gly Met Arg Asn Leu Leu Tyr 
210 " 215 220 

Leu Tyr His Arg He 
225 



<210> 16 
<211> 257 
<212> PRT 

<213> Synechocystis sp. 
<400> 16 

Met Glu Thr Ser Lys Pro Gly Cys Gly Leu Ala Leu Asp Asn Asp Lys 
1 5 10 15 

Gin Lys Pro Trp Leu Gly Leu Met He Gly Asn Ser Arg Leu His Trp 
20 25 30 

Ala Tyr Cys Ser Gly Asn Ala Pro Leu Gin Thr Trp Val Thr Asp Tyr 
35 40 45 

Asn Pro Lys Ser Ala Gin Leu Pro Val Leu Leu Gly Lys Val Pro Leu 
50 55 60 

Met Leu Ala Ser Val Val Pro Glu Gin Thr Glu Val Trp Arg Val Tyr 
65 70 75 80 

Gin Pro Lys lie Leu Thr Leu Lys Asn Leu Pro Leu Val Asn Leu Tyr 
85 90 95 

Pro Ser Phe Gly He Asp Arg Ala Leu Ala Gly Leu Gly Thr Gly Leu 
100 105 110 

Thr Tyr Gly Phe Pro Cys Leu Val Val Asp Gly Gly Thr Ala Leu Thr 
115 120 125 

He Thr Gly Phe Asp Gin Asp Lys Lys Leu Val Gly Gly Ala He Leu 
130 135 140 

Pro Gly Leu Gly Leu Gin Leu Ala Thr Leu Gly Asp Arg Leu Ala Ala 
145 150 155 160 

Leu Pro Lys Leu Glu Met Asp Gin Leu Thr Glu Leu Pro Asp Arg Trp 
165 170 175 

Ala Leu Asp Thr Pro Ser Ala He Phe Ser Gly Val Val Tyr Gly Val 
180 185 190 



Leu Gly Ala Leu Gin Ser Tyr Leu Gin Asp Trp Gin Lys Leu Phe Pro 
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Gly Ala Ala Met Val lie Thr Gly 
210 215 

Phe Leu Lys Glu His Ser Pro Asn 

225 "* 230 

Leu lie Phe Leu Gly Met Ala Ala 
245 

Cys 



- 18 - 

205 

Gly Asp Gly Lys He Leu His Gly 

220 

Leu Ser Val Ala Trp Asp Asp Asn 
235 240 

He His His Gly Asp Arg Pro He 
250 255 



<210> 17 

<211> 223 

<212> PRT 

<213> Helicobacter pylori 

<400> 17 

Met Pro Ala Arg Gin Ser Phe Thr Asp Leu Lys Asn Leu Val Leu Cys 
1 " 5 ~ 10 15 

Asp He Gly Asn Thr Arg He His Phe Ala Gin Asn Tyr Gin Leu Phe 
20 25 30 

Ser Ser Ala Lys Glu Asp Leu Lys Arg Leu Gly He Gin Lys Glu He 
35 40 45 

Phe Tyr He Ser Val Asn Glu Glu Asn Glu Lys Ala Leu Leu Asn Cys 
50 55 60 

Tyr Pro Asn Ala Lys Asn He Ala Gly Phe Phe His Leu Glu Thr Asp 
65 70 75 80 

Tyr Val Gly Leu Gly He Asp Arg Gin Met Ala Cys Leu Ala Val Asn 
85 90 95 

Asn Gly Val Val Val Asp Ala Gly Ser Ala He Thr He Asp Leu He 
100 " 105 110 

Lys Glu Gly Lys His Leu Gly Gly Cys lie Leu Pro Gly Leu Ala Gin 
115 120 125 

Tyr He His Ala Tyr Lys Lys Ser Ala Lys lie Leu Glu Gin Pro Phe 
130 135 * 140 

Lys Ala Leu Asp Ser Leu Glu Val Leu Pro Lys Ser Thr Arg Asp Ala 
145 150 155 160 

Val Asn Tyr Gly Met Val Leu Ser Val lie Ala Cys lie Gin His Leu 
165 170 175 

Ala Lys Asn Gin Lys lie Tyr Leu Cys Gly Gly Asp Ala Lys Tyr Leu 
180 185 190 



Ser Ala Phe Leu Pro His Ser Val Cys Lys Glu Arg Leu Val Phe Asp 
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195 



200 



205 



Gly Met Glu lie Ala Leu Lys Lys Ala Gly lie Leu Glu Cys Lys 
210 215 220 



<210> 18 
<211> 267 
<212> PRT 

<213> Bordetella pertussis 
<400> 18 

Met lie lie Leu lie Asp Ser Gly Asn Ser Arg Leu Lys Val Gly Trp 
15 10 15 

Phe Asp Pro Asp Ala Pro Gin Ala Ala Arg Glu Pro Ala Pro Val Ala 
20 25 30 

Phe Asp Asn Leu Asp Leu Asp Ala Leu Gly Arg Trp Leu Ala Thr Leu 
35 40 45 

Pro Arg Arg Pro Gin Arg Ala Leu Gly Val Asn Val Ala Gly Leu Ala 
50 55 60 

Arg Gly Glu Ala lie Ala Ala Thr Leu Arg Ala Gly Gly Cys Asp lie 
65 70 75 80 

Arg Trp Leu Arg Ala Gin Pro Leu Ala Met Gly Leu Arg Asn Gly Tyr 
85 90 95 

Arg Asn Pro Asp Gin Leu Gly Ala Asp Arg Trp Ala Cys Met Val Gly 
100 105 110 

Val Leu Ala Arg Gin Pro Ser Val His Pro Pro Leu Leu Val Ala Ser 
115 120 125 

Phe Gly Thr Ala Thr Thr Leu Asp Thr lie Gly Pro Asp Asn Val Phe 
130 135 140 

Pro Gly Gly Leu lie Leu Pro Gly Pro Ala Met Met Arg Gly Ala Leu 
145 ~ 150 155 160 

Ala Tyr Gly Thr Ala His Leu Pro Leu Ala Asp Gly Leu Val Ala Asp 
165 170 175 

Tyr Pro lie Asp Thr His Gin Ala lie Ala Ser Gly lie Ala Ala Ala 
180 185 190 

Gin Ala Gly Ala lie Val Arg Gin Trp Leu Ala Gly Arg Gin Arg Tyr 
195 200 205 

Gly Gin Ala Pro Glu lie Tyr Val Ala Gly Gly Gly Trp Pro Glu Val 
210 215 220 

Arg Gin Glu Ala Glu Arg Leu Leu Ala Val Thr Gly Ala Ala Phe Gly 
225 230 235 240 



Ala Thr Pro Gin Pro Thr Tyr Leu Asp Ser Pro Val Leu Asp Gly Leu 
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245 



250 



255 



Ala Ala Leu Ala Ala Gin Gly Ala Pro Thr Ala 
260 265 



<210> 19 

<211> 777 

<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (774) 

<400> 19 

ttg tta ctg gtt ate gat gtg ggg aac acc aat act gta ctt ggt gta 48 

Leu Leu Leu Val He Asp Val Gly Asn Thr Asn Thr Val Leu Gly Val 
15 10 15 

tat cat gat gga aaa tta gaa tat cac tgg cgt ata gaa aca age agg 96 
Tyr His Asp Gly Lys Leu Glu Tyr His Trp Arg He Glu Thr Ser Arg 
20 25 30 

cat aaa aca gaa gat gag ttt ggg atg att ttg cgc tec tta ttt gat 144 
His Lys Thr Glu Asp Glu Phe Gly Met He Leu Arg Ser Leu Phe Asp 
35 40 45 

cac tec ggg ctt atg ttt gaa cag ata gat ggc att att att teg tea 192 
His Ser Gly Leu Met Phe Glu Gin He Asp Gly He He He Ser Ser 
50 55 60 

gta gtg ccg cca ate atg ttt gcg tta gaa aga atg tgc aca aaa Lac 240 
Val Val Pro Pro He Met Phe Ala Leu Glu Arg Met Cys Thr Lys Tyr 
65 70 75 80 

ttt cat ate gag cct caa att gtt ggt cca ggt atg aaa acc ggt tta 288 
Phe His He Glu Pro Gin He Val Gly Pro Gly Met Lys Thr. Gly Leu 
85 90 95 

aat ata aaa tat gac aat ccg aaa gaa gta ggg gca gac aga ate gta 336 
Asn He Lys Tyr Asp Asn Pro Lys Glu Val Gly Ala Asp Arg He Val 
100 105 110 

aat get gtc get gcg ata cac ttg tac ggc aat cca tta att gtt gtc 384 
Asn Ala Val Ala Ala He His Leu Tyr Gly Asn Pro Leu He Val Val 
115 120 . 125 

gat ttc gga acc gee aca acg tac tgc tat att gat gaa aac aaa caa 432 . 
Asp Phe Gly Thr Ala Thr Thr Tyr Cys Tyr He Asp Glu Asn Lys Gin 
130 135 140 

tac atg ggc ggg gcg att gec cct ggg att aca att teg aca gag gcg 480 
Tyr Met Gly Gly Ala He Ala Pro Gly He Thr He Ser Thr Glu Ala 
145 150 155 160 

ctt tac teg cgt gca gca aag ctt cct cgt ate gaa ate acc egg ccc 528 
Leu Tyr Ser Arg Ala Ala Lys Leu Pro Arg He Glu He Thr Arg Pro 
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165 170 175 

gac aat att ate gga aaa aac act gtt age gcg atg caa tct gga att 576 

Asp Asn lie lie Gly Lys Asn Thr Val Ser Ala Met Gin Ser Gly He 

180 , 185 190 

tta ttt ggc tat gtc ggc caa gtg gaa gga ate gtt aag cga atg aaa 624 

Leu Phe Gly Tyr Val Gly Gin Val Glu Gly He Val Lys Arg Met Lys 

195 200 205 

tgg cag gca aaa cag gac etc aag gtc att gcg aca gga ggc ctg gcg 672 

Trp Gin Ala Lys Gin Asp Leu Lys Val He Ala Thr Gly Gly Leu Ala 

210 215 220 

ccg etc att gcg aac gaa tea gat tgt ata gac ate gtt gat cca ttc 720 

Pro Leu He Ala Asn Glu Ser Asp Cys lie Asp He Val Asp Pro Phe 

225 230 ' 235 240 

tta acc eta aaa ggg ctg gaa ttg att tat gaa aga aac cgc gta gga 768 

Leu Thr Leu Lys Gly Leu Glu Leu He Tyr Glu Arg Asn Arg Val Gly 
245 250 255 

agt gta tag 777 
Ser Val 



<210> 20 
<211> 960 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (957) 

<400> 20 

gtg aaa aat aaa gaa ctt aac eta cat act tta tat aca cag cac aat 48 
Met Lys Asn Lys Glu Leu Asn Leu His Thr Leu Tyr Thr Gin His Asn 
15 10 15 

egg gag tct tgg tct ggt ttt ggg ggg cat ttg teg att get gta tct 96 
Arg Glu Ser Trp Ser Gly Phe Gly Gly His Leu Ser He Ala Val Ser 
20 25 30 

gaa gaa gag gca aaa get gtg gaa gga ttg aat gat tat eta tct gtt 14 4 
Glu Glu Glu Ala Lys Ala Val Glu Gly Leu Asn Asp Tyr Leu Ser Val 
35 40 45 

gaa gaa gtg gag acg ate tat att ccg ctt gtt cgc ttg ctt cat tta 192 
Glu Glu Val Glu Thr He Tyr lie Pro Leu Val Arg Leu Leu His Leu 
50 55 60 

cat gtc aag tct gcg get gaa cgc aat aag cat gtc aat gtt ttt ttg 240 
His Val Lys Ser Ala Ala Glu Arg Asn Lys His Val Asn Val Phe Leu 
65 70 75 80 



aag cac cca cat tea gee aaa att ccg ttt att ate ggc att gee ggc 
Lys His Pro His Ser Ala Lys lie Pro Phe lie lie Gly lie Ala Gly 



288 
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85 90 95 

agt gtc gca gtc gga aaa age acg acg gcg egg ate ttg cag aag ctg 336 

Ser Val Ala Val Gly Lys Ser Thr Thr Ala Arg lie Leu Gin Lys Leu 
100 105 110 

ctt teg cgt ttg cct gac cgt cca aaa gtg age ctt ate acg aca gat 384 

Leu Ser Arg Leu Pro Asp Arg Pro Lys Val Ser Leu He Thr Thr Asp 
115 120 125 

ggt ttt tta ttt cct act gec gag ctg aaa aag aaa aat atg atg tea 432 

Gly Phe Leu Phe Pro Thr Ala Glu Leu Lys Lys Lys Asn Met Met Ser 
130 135 140 

aga aaa gga ttt cct gaa age tat gat gta aag gcg ctg etc gaa ttt 4 80 

Arg Lys Gly Phe Pro Glu Ser Tyr Asp Val Lys Ala Leu Leu Glu Phe 
145 150 155 160 



ttg aat gac tta aaa tea gga aag gac age gta aag gee ccg gtg tat 
Leu Asn Asp Leu Lys Ser Gly Lys Asp Ser Val Lys Ala Pro Val Tyr 
165 " 170 175 



cct gat tea tat ttt cat aaa ttt aaa gac ttg tec gat cag gag get 
Pro Asp Ser Tyr Phe His Lys Phe Lys Asp Leu Ser Asp Gin Glu Ala 
260 265 270 



528 



tec cat eta ace tat gac cgc gag gaa ggt gtg ttc gag gtt gta gaa 576 
Ser His Leu Thr Tyr Asp Arg Glu Glu Gly Val Phe Glu Val Val Glu 
180 185 190 

cag gcg gat att gtg att att gaa ggc att aat gtt ctt cag teg ccc 624 
Gin Ala Asp He Val He He Glu Gly He Asn Val Leu Gin Ser Pro 
195 200 205 

ace ttg gag gat gac egg gaa aac ccg cgt att ctt gtt tec gat ttc 672 
Thr Leu Glu Asp Asp Arg Glu Asn Pro Arg He Phe Val Ser Asp Phe 
210 215 220 

ttt gat ttt teg att tat gtg gat gcg gag gaa age egg att ttc act 720 
Phe Asp Phe Ser He Tyr Val Asp Ala Glu Glu Ser Arg He Phe Thr 
225 230 235 240 

tgg tat tta gag cgt ttt cgc ctg ctt egg gaa aca get ttt caa aat 7 68 
Trp Tyr Leu Glu Arg Phe Arg Leu Leu Arg Glu Thr Ala Phe Gin Asn 
245 250 255 



816 



gac gag atg gca gec teg att tgg gag agt gtc aac egg ccg aat tta 864 

Asp Glu Met Ala Ala Ser He Trp Glu Ser Val Asn Arg Pro Asn Leu 
275 280 285 

tat gaa aat att ttg cca act aaa ttc agg tea gat etc att ttg cgt 912 

Tyr Glu Asn He Leu Pro Thr Lys Phe Arg Ser Asp Leu He Leu Arg 
290 295 300 

aag gga gac ggg cat aag gtc gag gaa gtg ttg gta agg agg gta tga 960 

Lys Gly Asp Gly His Lys Val Glu Glu Val Leu Val Arg Arg Val 

305 310 315 



WO 01/21772 



PCT/US00/25993 



•23 



<210> 21 
<211> 882 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (879) 

<400> 21 

ttg teg att get gta tct gaa gaa gag gca aaa get gtg gaa gga ttg 48 
Met Ser He Ala Val Ser Glu Glu Glu Ala Lys Ala Val Glu Gly Leu 
15 10 15 

aat gat tat" eta tct gtt gaa gaa gtg gag acg ate tat att ccg ctt 96 
Asn Asp Tyr Leu Ser Val Glu Glu Val Glu Thr He Tyr lie Pro Leu 
20 25 30 

gtt cgc ttg ctt cat tta cat gtc aag tct gcg get gaa cgc aat aag 144 
Val Arg Leu Leu His Leu His Val Lys Ser Ala Ala Glu Arg Asn Lys 
35 40 " 45 

cat gtc aat gtt ttt ttg aag cac cca cat tea gee aaa att ccg ttt 192 
His Val Asn Val Phe Leu Lys His Pro His Ser Ala Lys He Pro Phe 
50 55 60 

att ate ggc att gee ggc agt gtc gca gtc gga aaa age acg acg gcg 240 
He He Gly He Ala Gly Ser Val Ala Val Gly Lys Ser Thr Thr Ala 
65 70 75 80 

egg ate ttg cag aag ctg ctt teg cgt ttg cct gac cgt cca aaa gtg 288 
Arg He Leu Gin Lys Leu Leu Ser Arg Leu Pro Asp Arg Pro Lys Val 
85 90 95 

age ctt ate acg aca gat ggt ttt tta ttt cct act gee gag ctg aaa 336 
Ser Leu He Thr Thr Asp Gly Phe Leu Phe Pro Thr Ala Glu Leu Lys 
100 * 105 110 

aag aaa aat atg atg tea aga aaa gga ttt cct gaa age tat gat gta 384 
Lys Lys Asn Met Met Ser Arg Lys Gly Phe Pro Glu Ser Tyr Asp Val 
115 120 125 

aag gcg ctg etc gaa ttt ttg aat gac tta aaa tea gga aag gac age 432 
Lys Ala Leu Leu Glu Phe Leu Asn Asp Leu Lys Ser Gly Lys Asp Ser 
130 135 140 

gta aag gee ccg gtg tat tec cat eta acc tat gac cgc gag gaa ggt 480 
Val Lys Ala Pro Val Tyr Ser His Leu Thr Tyr Asp Arg Glu Glu Gly 
145 150 155 160 

gtg tte gag gtt gta gaa cag gcg gat att gtg att att ga"a ggc att 528 
Val Phe Glu Val Val Glu Gin Ala Asp He Val He He Glu Gly He 
165 170 175 

aat gtt ctt cag teg ccc acc ttg gag gat gac egg gaa aac ccg cgt 576 
Asn Val Leu Gin Ser Pro Thr Leu Glu Asp Asp Arg Glu Asn Pro Arg 
180 185 190 
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att ttt gtt tec gat ttc ttt gat ttt teg att tat gtg gat gcg gag 624 
lie Phe Val Ser Asp Phe Phe Asp Phe Ser lie Tyr Val Asp Ala Glu 
195 200 205 

gaa age egg att ttc act tgg tat tta gag cgt ttt cgc ctg ctt egg 672 
Glu Ser Arg lie Phe Thr Trp Tyr Leu Glu Arg Phe Arg Leu Leu Arg 
210 215 220 

gaa aca get ttt caa aat cct gat tea tat ttt cat aaa ttt aaa gac 720 
Glu Thr Ala Phe Gin Asn Pro Asp Ser Tyr Phe His Lys Phe Lys Asp 
225 230 235 240 

ttg tec gat cag gag get gac gag atg gca gee teg att tgg gag agt 768 
Leu Ser Asp Gin Glu Ala Asp Glu Met Ala Ala Ser lie Trp Glu Ser 
245 . 250 255 

gtc aac egg ccg aat tta tat gaa aat att ttg cca act aaa ttc agg 816 
Val Asn Arg Pro Asn Leu Tyr Glu Asn lie Leu Pro Thr Lys Phe Arg 
260 265 270 • 

tea gat etc att ttg cgt aag gga gac ggg cat aag gtc gag gaa gtg 864 
Ser Asp Leu lie Leu Arg Lys Gly Asp Gly His Lys Val Glu Glu Val 
275 280 285 

ttg gta agg agg gta tga 882 
Leu Val Arg Arg Val 
290 



<210> 22 
<211> 846 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (843) 

<400> 22 

gtg gaa gga ttg aat gat tat eta tct gtt gaa gaa gtg gag acg ate 48 

Met Glu Gly Leu Asn Asp Tyr Leu Ser Val Glu Glu Val Glu Thr He 

15 10 15 

tat att ccg ctt gtt cgc ttg ctt cat tta cat gtc aag tct gcg get 96 
Tyr He Pro Leu Val Arg Leu Leu His Leu His Val Lys Ser Ala Ala 
20 25 30 

gaa cgc aat aag cat gtc aat gtt ttt ttg aag cac cca cat tea gee 144 
Glu Arg Asn Lys His Val Asn Val Phe Leu Lys His Pro His Ser Ala 
35 4 0 4 5 

aaa att ccg ttt att ate ggc att gec ggc agt gtc gca gtc gga aaa 192 
Lys He Pro Phe He He Gly He Ala Gly Ser Val Ala Val Gly Lys 
50 55 60 



age acg acg gcg egg ate ttg cag aag ctg ctt teg cgt ttg cct gac 240 
Ser Thr Thr Ala Arg He Leu Gin Lys Leu Leu Ser Arg Leu Pro Asp 
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65 70 75 80 

cgt cca aaa gtg age ctt ate acg aca gat ggt ttt tta ttt cct act 288 
Arg Pro Lys Val Ser Leu lie Thr Thr Asp Gly Phe Leu Phe Pro Thr 
85 90 95 

gec gag ctg aaa aag aaa aat atg atg tea aga aaa gga ttt cct gaa 336 
Ala Glu Leu Lys Lys Lys Asn Met Met Ser Arg Lys Gly Phe Pro Glu 
100 105 110 

age tat gat gta aag gcg ctg etc gaa ttt ttg aat gac tta aaa tea 384 
Ser Tyr Asp Val Lys Ala Leu Leu Glu Phe Leu Asn Asp Leu Lys Ser 
115 120 125 

gga aag gac age gta aag gee ccg gtg tat tec cat eta acc tat gac 432 
Gly Lys Asp Ser Val Lys Ala Pro Val Tyr Ser His Leu Thr Tyr Asp 
130 135 140 

cgc gag gaa ggt gtg ttc gag gtt gta gaa cag gcg gat att gtg att 480 
Arg Glu Glu Gly Val Phe Glu Val Val Glu Gin Ala Asp He Val He 
145 150 155 160 

att gaa ggc att aat gtt ctt cag teg ccc acc ttg gag gat gac egg 528 
He Glu Gly He Asn Val Leu Gin Ser Pro Thr Leu Glu Asp Asp Arg 
165 170 175 

gaa aac ccg cgt att ttt gtt tec gat ttc ttt gat ttt teg att tat 576 
Glu Asn Pro Arg He Phe Val Ser Asp Phe Phe Asp Phe Ser He Tyr 
180 185 190 

gtg gat gcg gag gaa age egg att ttc act tgg tat tta gag cgt ttt 624 
Val Asp Ala Glu Glu Ser Arg He Phe Thr Trp Tyr Leu Glu Arg Phe 
195 200 205 

cgc ctg ctt egg gaa aca get ttt caa aat cct gat tea tat ttt cat 672 
Arg Leu Leu Arg Glu Thr Ala Phe Gin Asn Pro Asp Ser Tyr Phe His 
210 215 220 

aaa ttt aaa gac ttg tec gat cag gag get gac gag atg gca gee teg 720 
Lys Phe Lys Asp Leu Ser Asp Gin Glu Ala Asp Glu Met Ala Ala Ser 
225 230 235 240 

att tgg gag agt gtc aac egg ccg aat tta tat gaa aat att ttg cca 768 
lie Trp Glu Ser Val Asn Arg Pro Asn Leu Tyr Glu Asn He Leu Pro 
245 250 255 

act aaa ttc agg tea gat etc att ttg cgt aag gga gac ggg cat aag 816 
Thr Lys Phe Arg Ser Asp Leu He Leu Arg Lys Gly Asp Gly His Lys 
260 265 270 

gtc gag gaa gtg ttg gta agg agg gta tga 846 
Val Glu Glu Val Leu Val Arg Arg Val 
275 280 



<210> 23 
<211> 831 
<212> DNA 
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<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (831) 

<400> 23 

atg aaa aca aaa ctg gat ttt eta aaa atg aag gag tct gaa gaa ccg 48 

Met Lys Thr Lys Leu Asp Phe Leu Lys Met Lys Glu Ser Glu Glu Pro 
15 10 15 

att gtc atg ctg acc get tat gat tat ccg gca get aaa ctt get gaa 96 
lie Val Met Leu Thr Ala Tyr Asp Tyr Pro Ala Ala Lys Leu Ala Glu 
20 25 30 

caa gcg gga gtt gac atg att tta gtc ggt gat tea ctt gga atg gtc 144 
Gin Ala Gly Val Asp Met He Leu Val Gly Asp Ser Leu Gly Met Val 
35 40 45 

gtc etc ggc ctt gat tea act gtc ggt gtg aca gtt gcg gac atg ate 192 
Val Leu Gly Leu Asp Ser Thr Val Gly Val Thr Val Ala Asp Met He 
50 55 60 

cat cat aca aaa gee gtt aaa agg ggt gcg ccg aat acc ttt att gtg 240 
His His Thr Lys Ala Val Lys Arg Gly Ala Pro Asn Thr Phe He Val 
65 70 75 80 

aca gat atg ccg ttt atg tct tat cac ctg tct aag gaa gat acg ctg 288 
Thr Asp Met Pro Phe Met Ser Tyr His Leu Ser Lys Glu Asp Thr Leu 
85 90 95 

aaa aat gca gcg get ate gtt cag gaa age gga get gac gca ctg aag 336 
Lys Asn Ala Ala Ala He Val Gin Glu Ser Gly Ala Asp Ala Leu Lys 
100 105 110 

ctt gag ggc gga gaa ggc gtg ttt gaa tec att cgc gca ttg acg ctt 384 
Leu Glu Gly Gly Glu Gly Val Phe Glu Ser He Arg Ala Leu Thr Leu 
115 120 125 

gga ggc att cca gta gtc agt cac tta ggt ttg aca ccg cag tea gtc 432 
Gly Gly He Pro Val Val Ser His Leu Gly Leu Thr Pro Gin Ser Val 
130 135 140 

ggc gta ctg ggc ggc tat aaa gta cag ggc aaa gac gaa caa age gee 480 
Gly Val Leu Gly Gly Tyr Lys Val Gin Gly Lys Asp Glu Gin Ser Ala 
145 150 ' 155 160 

aaa aaa tta ata gaa gac agt ata aaa tgc gaa gaa gca gga get atg 528 
Lys Lys Leu He Glu Asp Ser He Lys Cys Glu Glu Ala Gly Ala Met 
165 170 175 

atg ctt gtg ctg gaa tgt gtg ccg gca gaa etc aca gee aaa att gee 576 
Met Leu Val Leu Glu Cys Val Pro Ala Glu Leu Thr Ala Lys He Ala 
180 185 190 

gag acg eta age ata ccg gtc att gga ate ggg get ggt gtg aaa gcg 624 
Glu Thr Leu Ser He Pro Val He Gly He Gly Ala Gly Val Lys Ala 
195 200 205 
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gac gga caa gtt etc gtt tat cat gat att ate ggc cac ggt gtt gag 672 
Asp Gly Gin Val Leu Val Tyr His Asp He He Gly His Gly Val Glu 
210 215 220 

aga aca cct aaa ttt gta aag caa tat acg cgc att gat gaa acc ate 720 
Arg Thr Pro Lys Phe Val Lys Gin Tyr Thr Arg He Asp Glu Thr He 
225 230 235 240 

gaa aca gca ate age gga tat gtt cag gat gta aga cat cgt get ttc 768 
Glu Thr Ala He Ser Gly Tyr Val Gin Asp Val Arg His Arg Ala Phe 
245 250 255 

cct gaa caa aag cat tec ttt caa atg aac cag aca gtg ctt gac ggc 816 
Pro Glu Gin Lys His Ser Phe Gin Met Asn Gin Thr Val Leu Asp Gly 
260 265 270 

ttg tac ggg gga aaa 831 
Leu Tyr Gly Gly Lys 
275 



<210> 24 
<211> 277 
<212> PRT 

<213> Bacillus subtilis 
<400> 24 

Met Lys Thr Lys Leu Asp Phe Leu Lys Met Lys Glu Ser Glu Glu Pro 
1 5 10 15 

He Val Met Leu Thr Ala Tyr Asp Tyr Pro Ala Ala Lys Leu Ala Glu 
20 25 30 

Gin Ala Gly Val Asp Met lie Leu Val Gly Asp Ser Leu Gly Met Val 
35 40 45 

Val Leu Gly Leu Asp Ser Thr Val Gly Val Thr Val Ala Asp Met He 
50 55 60 

His His Thr Lys Ala Val Lys Arg Gly Ala Pro Asn Thr Phe He Val 
65 70 75 80 

Thr Asp Met Pro Phe Met Ser Tyr His Leu Ser Lys Glu Asp Thr Leu 
85 , 90 95 

Lys Asn Ala Ala Ala lie Val. Gin Glu Ser Gly Ala Asp Ala Leu Lys 
100 105 ' 110 

Leu Glu Gly Gly Glu Gly Val Phe Glu Ser He Arg Ala Leu Thr Leu 
115 120 125 

Gly Gly He Pro Val Val Ser His Leu Gly Leu Thr Pro Gin Ser Val 
130 135 140 

Gly Val Leu Gly Gly Tyr Lys Val Gin Gly Lys Asp Glu Gin Ser Ala 
145 150 155 160 
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Lys Lys Leu He Glu Asp Ser He Lys Cys Glu Glu Ala Gly Ala Met 
165 * 170 175 

Met Leu Val Leu Glu Cys Val Pro Ala Glu Leu Thr Ala Lys He Ala 
180 185 190 

Glu Thr Leu Ser He Pro Val He Gly He Gly Ala Gly Val Lys Ala 
195 200 205 

Asp Gly Gin Val Leu Val Tyr His Asp He He Gly His Gly Val Glu 
210 215 220 

Arg Thr Pro Lys Phe Val Lys Gin Tyr Thr Arg He Asp Glu Thr He 
225 230 235 240 

Glu Thr Ala He Ser Gly Tyr Val Gin Asp Val Arg His Arg Ala Phe 
245 250 255 



Pro Glu Gin Lys His Ser Phe Gin Met Asn Gin Thr Val Leu Asp Gly 
260 265 270 



Leu Tyr Gly Gly Lys 
27 5 



<210> 25 
<211> 858 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (858) 

<400> 25 

atg aga cag att act gat att tea cag ctg aaa. gaa gec ata aaa caa 48 

Met Arg Gin lie Thr Asp He Ser Gin Leu Lys Glu Ala He Lys Gin 
15 10 15 

tac cat tea gag ggc aag tea ate gga ttt gtt ccg acg atg ggg ttt 96 
Tyr His Ser Glu Gly Lys Ser He Gly Phe Val Pro Thr Met Gly Phe 
20 25 30 

ctg cat gag ggg cat tta ace tta gca gac aaa gca aga caa gaa aac 14 4 
Leu His Glu Gly His Leu Thr Leu Ala Asp Lys Ala Arg Gin Glu Asn 
35 40 45 

gac gec gtt att atg agt att ttt gtg aat cct gca caa ttc ggc cct 192 
Asp Ala Val He Met Ser He Phe Val Asn Pro Ala Gin Phe Gly Pro 
50 55 60 

aat gaa gat ttt gaa gca tat ccg cgc gat att gag egg gat gca get 240 
Asn Glu Asp Phe Glu Ala Tyr Pro Arg Asp He Glu Arg Asp Ala Ala 
65 70 75 80 

ctt gca gaa aac gec gga gtc gat att ctt ttt acg cca gat get cat 288 
Leu Ala Glu Asn Ala Gly Val Asp He Leu Phe Thr Pro Asp Ala His 
85 90 95 
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gat atg tat ccc ggt gaa aag aat gtc acg att cat gta gaa aga cgc 336 

Asp Met Tyr Pro Gly Glu Lys Asn Val Thr He His Val Glu Arg Arg 
100 105 110 

aca gac gtg tta tgc ggg cgc tea aga gaa .gga cat ttt gac ggg gtc 384 
Thr Asp Val Leu Cys Gly Arg Ser Arg Glu Gly His Phe Asp Gly Val 
115 120 125 

gcg ate gta ctg acg aag ctt ttc aat eta gtc aag ccg act cgt gee 4 32 

Ala He Val Leu Thr Lys Leu Phe Asn Leu Val Lys Pro Thr Arg Ala 

130 135 140 

tat ttc ggt tta aaa gat gcg cag cag gta get gtt gtt gat ggg tta 480 

Tyr Phe Gly Leu Lys Asp Ala Gin Gin Val Ala Val Val Asp Gly Leu 
145 150 155 160 

ate age gac ttc ttc atg gat att gaa ttg gtt cct gtc gat acg gtc 528 

He Ser Asp Phe Phe Met Asp He Glu Leu Val Pro Val Asp Thr Val 
165 170 175 

aga gag gaa gac ggc tta gec aaa age tct cgc aat gta tac tta aca 576 

Arg Glu Glu Asp Gly Leu Ala Lys Ser Ser Arg Asn Val Tyr Leu Thr 
180 185 190 

get gag gaa aga aaa gaa gcg cct aag ctg tat egg gee ctt caa aca 624 

Ala Glu Glu Arg Lys Glu Ala Pro Lys Leu Tyr Arg Ala Leu Gin Thr 
195 200 205 

agt gcg gaa ctt gtc caa gee ggt gaa aga gat cct gaa gcg gtg ata 6*72 

Ser Ala Glu Leu Val Gin Ala Gly Glu Arg Asp Pro Glu Ala Val He 

210 215 " 220 

aaa get gca aaa gat ate att gaa acg act age gga acc ata gac tat 720 

Lys Ala Ala Lys Asp He He Glu Thr Thr Ser Gly Thr lie Asp Tyr 
225 230 235 240 

gta gag ctt tat tec tat ccg gaa etc gag cct gtg aat gaa att get 768 

Val Glu Leu Tyr Ser Tyr Pro Glu Leu Glu Pro Val Asn Glu He Ala 
245 250 255 

gga aag atg att etc get gtt gca gtt get ttt tea aaa gcg cgt tta 816 

Gly Lys Met He Leu Ala Val Ala Val Ala Phe Ser Lys Ala Arg Leu 
260 265 270 

ata gat aat ate att att gat att cga gaa atg gag aga ata 858 

lie Asp Asn He lie lie Asp He Arg Glu Met Glu Arg lie 
275 280 285 



<210> 26 
<211> 286 
<212> PRT 

<213> Bacillus subtilis 
<400> 26 

Met Arg Gin lie Thr Asp He Ser Gin Leu Lys Glu Ala lie Lys Gin 
15 10 15 
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Tyr His Ser Glu Gly Lys Ser He Gly Phe Val Pro Thr Met Gly Phe 
20 25 30 

Leu His Glu Gly His Leu Thr Leu Ala Asp Lys Ala Arg Gin Glu Asn 
35 40 45 

Asp Ala Val He Met Ser He Phe Val Asn Pro Ala Gin Phe Gly Pro 
50 55 60 

Asn Glu Asp Phe Glu Ala Tyr Pro Arg Asp He Glu Arg Asp Ala Ala 
65 70 75 80 

Leu Ala Glu Asn Ala Gly Val Asp He Leu Phe Thr Pro Asp Ala His 
85 90 95 

Asp Met Tyr Pro Gly Glu Lys Asn Val Thr He His Val Glu Arg Arg 
100 105 110 

Thr Asp Val Leu Cys Gly Arg Ser Arg Glu Gly His Phe Asp Gly Val 
115 120 125 

Ala He Val Leu Thr Lys Leu Phe Asn Leu Val Lys Pro Thr Arg Ala 
130 135 140 

Tyr Phe Gly Leu Lys Asp Ala Gin Gin Val Ala Val Val Asp Gly Leu 
145 150 155 160 

He Ser Asp Phe Phe Met Asp lie Glu Leu Val Pro Val Asp Thr Val 
165 170 175 

Arg Glu Glu Asp Gly Leu Ala Lys Ser Ser Arg Asn Val Tyr Leu Thr 
180 185 190 

Ala Glu Glu Arg Lys Glu Ala Pro Lys Leu Tyr Arg Ala Leu Gin Thr 
195 ~ 200 " 205 

Ser Ala Glu Leu Val Gin Ala Gly Glu Arg Asp Pro Glu Ala Val He 
210 215 220 

Lys Ala Ala Lys Asp He He Glu Thr Thr Ser Gly Thr He Asp Tyr 
225 230 235 240 

Val Glu Leu Tyr Ser Tyr Pro Glu Leu Glu Pro Val Asn Glu He Ala 
245 250 255 

Gly Lys Met lie Leu Ala Val Ala Val Ala Phe Ser Lys Ala Arg Leu 
260 265 270 

He Asp Asn lie lie lie Asp He Arg Glu Met Glu Arg lie 
275 280 285 



<210> 27 
<211> 381 
<212> DNA 

<213> Bacillus subtilis 
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<220> 

<221> CDS 

<222> (1) . . (381) 

<400> 27 

atg tat cga aca atg atg age ggc aaa ctt cac agg gca act gtt acg 48 

Met Tyr Arg Thr Met Met Ser Gly Lys Leu His Arg Ala Thr Val Thr 
15 10 15 

gaa gca aac ctg aac tat gtg gga age att aca att gat gaa gat etc 96 
Glu Ala Asn Leu Asn Tyr Val Gly Ser lie Thr lie Asp Glu Asp Leu 
20 25 30 

att gat get gtg gga atg ctt cct aat gaa aaa gta caa att gtg aat 144 
He Asp Ala Val Gly Met Leu Pro Asn Glu Lys Val Gin He Val Asn 
35 40 45 

aat aat aat gga gca cgt ctt gaa acg tat att att cct ggt aaa egg 192 
Asn Asn Asn Gly Ala Arg Leu Glu Thr Tyr He He Pro Gly Lys Arg 
50 ~ 55 60 

gga age ggc gtc ata tgc tta aac ggt gca gec gca cgc ctt gtg cag 240 
Gly Ser Gly Val He Cys Leu Asn Gly Ala Ala Ala Arg Leu Val Gin 
65 • 70 75 80 

gaa gga gat aag gtc att att att tec tac aaa atg atg tct gat caa 288 
Glu Gly Asp Lys Val He He He Ser Tyr Lys Met Met Ser Asp Gin 
85 90 95 

gaa gcg gca age cat gag ccg aaa gtg get gtt ctg aat gat caa aac 336 
Glu Ala Ala Ser His Glu Pro Lys Val Ala Val Leu Asn Asp Gin Asn 
100 105 110 

aaa att gaa caa atg ctg ggg aac gaa cca gee cgt aca att ttg 381 
Lys He Glu Gin Met Leu Gly Asn Glu Pro Ala Arg Thr He Leu 
115 120 125 



<210> 28 
<211> 127 
<212> PRT 

<213> Bacillus subtilis 
<400> 28 

Met Tyr Arg Thr Met Met Ser Gly Lys Leu His Arg Ala Thr Val Thr 
1 5 10 15 

Glu Ala Asn Leu Asn Tyr Val Gly Ser He Thr He Asp Glu Asp Leu 
20 25 30 

He Asp Ala Val Gly Met Leu Pro Asn Glu Lys Val Gin He Val Asn 
35 40 45 

Asn Asn Asn Gly Ala Arg Leu Glu Thr Tyr He He Pro Gly Lys Arg 
50 55 60 

Gly Ser Gly Val He Cys Leu Asn Gly Ala Ala Ala Arg Leu Val Gin 
65 70 75 80 
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Glu Gly Asp Lys Val lie lie lie Ser Tyr Lys Met Met Ser Asp Gin 
85 90 95 

Glu Ala Ala Ser His. Glu Pro Lys Val Ala Val Leu Asn Asp Gin Asn 
100 . 105 110 

Lys lie Glu Gin Met Leu Gly Asn Glu Pro Ala Arg Thr lie Leu 
115 120 125 



<210> 29 
<211> 894 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (894) 

<400> 29 

atg aaa att gga att ate ggc gga ggc tec gtt ggt ctt tta tgc gec 48 

Met Lys lie Gly He He Gly Gly Gly Ser Val Gly Leu Leu Cys Ala 
1-5 10 15 

tat tat ttg tea ctt tat cac gac gtg act gtt gtg acg agg egg caa 96 
Tyr Tyr Leu Ser Leu Tyr His Asp Val Thr Val Val Thr Arg Arg' Gin 
20 25 30 

gaa cag get gcg gee att cag tct gaa gga ate egg ctt tat aaa ggc 14 4 
Glu Gin Ala Ala Ala He Gin Ser Glu Gly He Arg Leu Tyr Lys Gly 
35 40 45 

ggg gag gaa ttc agg get gat tgc agt gcg gac acg agt ate aat teg 192 
Gly Glu Glu Phe Arg Ala Asp Cys Ser Ala Asp Thr Ser lie Asn Ser 
50 55 60 

gac ttt gac ctg ctt gtc gtg aca gtg aag cag cat cag ctt caa tct 240 
Asp Phe Asp Leu Leu Val Val Thr Val Lys Gin His Gin Leu Gin Ser 
65 70 75 80 

gtt ttt teg teg ctt gaa cga ate ggg aag acg aat ata tta ttt ttg 288 
Val Phe Ser Ser Leu Glu Arg lie Gly Lys Thr Asn lie Leu Phe Leu 
85 90 95 

caa aac ggc atg ggg cat ate cac gac eta aaa gac tgg cac gtt ggc 336 
Gin Asn Gly Met Gly His He His Asp Leu Lys Asp Trp His Val Gly 
100 105 110 

cat tec att tat gtt gga ate gtt gag cac gga get gta aga aaa teg 384 
His Ser He Tyr Val Gly He Val Glu His Gly Ala Val Arg Lys Ser 
115 120 125 

gat aca get gtt gat cat aca ggc eta ggt gcg ata aaa tgg age gcg 432 
Asp Thr Ala Val Asp His Thr Gly Leu Gly Ala lie Lys Trp Ser Ala 
130 135 140 



ttc gac gat get gaa cca gac egg ctg aac ate ttg ttt cag cat aac 



480 
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Phe Asp Asp Ala Glu Pro Asp Arg Leu Asn He Leu Phe Gin His Asn 
145 150 155 160 

cat teg gat ttt ccg att tat tat gag acg gat tgg tac cgt ctg ctg 528 
His Ser Asp Phe Pro He Tyr Tyr Glu Thr Asp Trp Tyr Arg Leu Leu 
165 170 175 

acg ggc aag ctg att gta aat gcg tgt att aat cct tta act gcg tta 576 
Thr Gly Lys Leu He Val Asn Ala Cys He Asn Pro Leu Thr Ala Leu 
180 185 190 

ttg caa gtg aaa aat gga gaa ctg ctg aca acg cca get tat ctg get 624 
Leu Gin Val Lys Asn Gly Glu Leu Leu Thr Thr Pro Ala Tyr Leu Ala 
195 200 205 

ttt atg aag ctg gta ttt cag gag gca tgc cgc att tta aaa ctt gaa 672 
Phe Met Lys Leu Val Phe Gin Glu Ala Cys Arg lie Leu Lys Leu Glu 
210 215 220 

aat gaa gaa aag get tgg gag egg gtt cag gec gtt tgt ggg caa acg 720 
Asn Glu Glu Lys Ala Trp Glu Arg Val Gin Ala Val Cys Gly Gin Thr 
225 230 235 240 

aaa gag aat cgt tea tea atg ctg gtt gac gtc att gga ggc egg cag 768 
Lys Glu Asn Arg Ser Ser Met Leu Val Asp Val He Gly Gly Arg Gin 
245 . 250 255 

acg gaa get gac gec att ate gga tac tta ttg aag gaa gca agt ctt 816 
Thr Glu Ala Asp Ala He He Gly Tyr Leu Leu Lys Glu Ala Ser Leu 
260 265 270 

caa ggt ctt gat gec gtc cac eta gag ttt tta tat ggc age ate aaa 8 64 
Gin Gly Leu Asp Ala Val His Leu Glu Phe Leu Tyr Gly Ser He Lys 
275 280 285 

gca ttg gag cga aat aca aac aaa gtc ttt 894 
Ala Leu Glu Arg Asn Thr Asn Lys Val Phe 
290 295 



<210> 30 
<211> 298 
<212> PRT 

<213> Bacillus subtilis 
<400> 30 

Met Lys He Gly lie lie Gly Gly Gly Ser Val Gly Leu Leu Cys Ala 
1 5 10 15 

Tyr Tyr Leu Ser Leu Tyr His Asp Val Thr Val Val Thr Arg Arg Gin 
20 25 30 

Glu Gin Ala Ala Ala lie Gin Ser Glu Gly lie Arg Leu Tyr Lys Gly 
35 40 45 

Gly Glu Glu Phe Arg Ala Asp Cys Ser Ala Asp Thr Ser lie Asn Ser 
50 55 60 
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Asp Phe 
65 



Asp Leu 



Val Phe Ser Ser 



Gin Asn 



His Ser 



Asp Thr 
130 



Gly Met 
100 

lie Tyr 
115 

Ala Val 



Phe Asp 
145 



Asp Ala 
His Ser Asp Phe 



Thr Gly 



Leu Gin 



Lys Leu 
180 

Val Lys 
195 



Phe Met 
210 



Lys Leu 
Glu Lys 
Lys Glu Asn Arg 



Asn Glu 
225 



Thr Glu 



Gin Gly 



Ala Leu 
290 



Ala Asp 
260 

Leu Asp 
275 

Glu Arg 



Leu Val Val 
70 

Leu Glu Arg 
85 

Gly His He 
Val Gly He 



Asp His Thr 
135 

Glu Pro Asp 
150 

Pro He Tyr 
165 

He Val Asn 



Asn Gly Glu 



Val Phe Gin 
215 

Ala Trp Glu 
230 

Ser Ser Met 
245 

Ala He He 



Ala Val His 



Thr Val 

He Gly 

His Asp 
105 

Val Glu 
120 

Gly Leu 
Arg Leu 
Tyr Glu 



Ala Cys 
185 

Leu Leu 
200 



Lys Gin 
75 

Lys Thr 
90 



His Gin 



Asn He 



Leu Lys Asp Trp 
His Gly 
Gly Ala 



Ala Val 
125 



Asn He 
155 

Thr Asp 
170 



He Lys 
140 

Leu Phe 



Trp Tyr 



He Asn Pro Leu 



Thr Thr 



Glu Ala Cys Arg 



Pro Ala 
205 

He Leu 
220 



Arg Val 



Leu Val 



Gly Tyr 
265 

Leu Glu 
280 



Gin Ala 
235 



Val Cys 
He Gly 
Leu Leu Lys Glu 



Asp Val 
250 



Phe Leu 



Tyr Gly 
285 



Leu Gin Ser 
80 

Leu Phe Leu 
95 

His Val Gly 
110 

Arg Lys Ser 

Trp Ser Ala 

Gin His Asn 
160 

Arg Leu Leu 
175 

Thr Ala Leu 
190 

Tyr Leu Ala 
Lys Leu Glu 



Gly Gin Thr 
240 

Gly Arg Gin 
255 

Ala Ser Leu 
270 

Ser He Lys 



Asn Thr Asn 
295 



Lys Val Phe 



<210> 31 
<211> 1725 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (1722) 

<400> 31 

atg ggg act aat gta cag gtg gat tea gca tct gec gaa tgt aca cag 4 8 
Met Gly Thr Asn Val Gin Val Asp Ser Ala Ser Ala Glu Cys Thr Gin 
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15 10 15 

acg atg age gga gca tta atg ctg att gaa tea tta aaa aaa gag aaa 96 
Thr Met Ser Gly Ala Leu Met Leu lie Glu Ser Leu Lys Lys Glu Lys 
20 25 30 

gta gaa atg ate ttc ggt tat ccg ggc ggg get gtg ctt ccg att tac 14 4 
Val Glu Met He Phe Gly Tyr Pro Gly Gly Ala Val Leu Pro He Tyr 
35 40 45 

gat aag eta tac aat tea ggg ttg gta cat ate ctt ccc cgt cac gaa 192 
Asp Lys Leu Tyr Asn Ser Gly Leu Val His He Leu Pro Arg His Glu 
50 55 60 

caa gga gca att cat gca gcg gag gga tac gca agg gtc tec gga aaa 240 
Gin Gly Ala He His Ala Ala Glu Gly Tyr Ala Arg Val Ser Gly Lys 
65 70 75 80 

ccg ggt gtc gtc att gee acg tea ggg ccg gga gcg aca aac ctt gtt 288 
Pro Gly Val Val He Ala Thr Ser Gly Pro Gly Ala Thr Asn Leu Val 
85 90 95 

aca ggc ctt get gat gee atg att gat tea ttg ccg tta gtc gtc ttt 336 
Thr Gly Leu Ala Asp Ala Met He Asp Ser Leu Pro Leu Val Val Phe 
100 105 110 

aca ggg cag gta gca ace tct gta ate ggg age gat gca ttt cag gaa 384 
Thr Gly Gin Val Ala Thr Ser Val He Gly Ser Asp Ala Phe Gin Glu 
115 120 125 

gca gac att tta ggg att acg atg cca gta aca aaa cac age tac cag 4 32 
Ala Asp He Leu Gly He Thr Met Pro Val Thr Lys His Ser Tyr Gin 
130 135 140 

gtt cgc cag ccg gaa gat ctg ccg cgc ate att aaa gaa gcg ttc cat 4 80 
Val Arg Gin Pro Glu Asp Leu Pro Arg He He Lys Glu Ala Phe His 
145 150 155 160 

att gca aca act gga aga ccc gga cct gta ttg att gat att ccg aaa 528 
He Ala Thr Thr Gly Arg Pro Gly Pro Val Leu He Asp He Pro Lys 
165 170 175 

gat gta gca aca att gaa gga gaa ttc age tac gat cat gag atg aat 576 
Asp Val Ala Thr lie Glu Gly Glu Phe Ser Tyr Asp His Glu Met Asn 
180 185 190 

etc ccg gga tac cag ccg aca aca gag ccg aat tat ttg cag ate cgc 624 
Leu Pro Gly Tyr Gin Pro Thr Thr Glu Pro Asn Tyr Leu Gin He Arg 
195 200 205 

aag ctt gtg gaa gee gtg age agt gcg aaa aaa ccg gtg ate ctg gcg 672 
Lys Leu Val Glu Ala Val Ser Ser Ala Lys Lys Pro Val He Leu Ala 
210 215 220 



ggt gcg ggc gta ctg cac gga aaa gcg tea gaa gaa tta aaa aat tat 
Gly Ala Gly Val Leu His Gly Lys Ala Ser Glu Glu Leu Lys Asn Tyr 
225 230 235 240 



720 
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gct gaa cag cag caa ate cct gtg gca cac acc ctt ttg ggg etc gga 768 
Ala Glu Gin Gin Gin lie Pro Val Ala His Thr Leu Leu Gly Leu Gly 
245 250 255 

ggc ttc ccg get gac cat ccg ctt ttc eta ggg atg gcg gga atg cac 816 
Gly Phe Pro Ala Asp His Pro Leu Phe Leu Gly Met Ala Gly Met His 
260 265 270 

ggt act tat aca gee aat atg gee ctt cat gaa tgt gat eta tta ate 864 
Gly Thr Tyr Thr Ala Asn Met Ala Leu His Glu Cys Asp Leu Leu lie 
275 280 285 

agt ate ggc gee cgt ttt gat gac cgt gtc aca gga aac ctg aaa cac 912 
Ser He Gly Ala Arg Phe Asp Asp Arg Val Thr Gly Asn Leu Lys His 
290 295 300 

ttt gee aga aac gca aag ata gee cac ate gat att gat cca get gaa 960 
Phe Ala Arg Asn Ala Lys He Ala His He Asp He Asp Pro Ala Glu 
305 " 310 315 320 

ate gga aaa ate atg aaa aca cag att cct gta gtc gga gac age aaa 1008 
He Gly Lys He Met Lys Thr Gin He Pro Val Val Gly Asp Ser Lys 
325 330 335 

att gtc ctg cag gag ctg ate aaa caa gac ggc aaa caa age gat tea 1056 
He Val Leu Gin Glu Leu He Lys Gin Asp Gly Lys Gin Ser Asp Ser 
340 345 350 

age gaa tgg aaa aaa cag etc gca gaa tgg aaa gaa gag tat ccg etc 1104 
Ser Glu Trp Lys Lys Gin Leu Ala Glu Trp Lys Glu Glu Tyr Pro Leu 
355 360 365 

tgg tat gta gat aat gaa gaa gaa ggt ttt aaa cct cag aaa ttg att 1152 
Trp Tyr Val Asp Asn Glu Glu Glu Gly Phe Lys Pro Gin Lys Leu He 
370 375 380 

gaa tat att cat caa ttt aca aaa gga gag gee att gtc gca acg gat 1200 
Glu Tyr He His Gin Phe Thr Lys Gly Glu Ala He Val Ala Thr Asp 
385 390 395 400 

gta ggc cag cat caa atg tgg tea gcg caa ttt tat ccg ttc caa aaa 1248 
Val Gly Gin His Gin Met Trp Ser Ala Gin Phe Tyr Pro Phe Gin Lys 
405 410 415 

gca gat aaa tgg gtc acg tea ggc gga ctt gga acg atg gga ttc ggt 1296 
Ala Asp Lys Trp Val Thr Ser Gly Gly Leu Gly Thr Met Gly Phe Gly 
420 425 430 

ctt ccg gcg gcg ate ggc gca cag ctg gee gaa aaa gat get act gtt 1344 
Leu Pro Ala Ala He Gly Ala Gin Leu Ala Glu Lys Asp Ala Thr Val 
435 440 445 

gtc gcg gtt gtc gga gac ggc gga ttc caa atg acg ctt caa gaa etc 1392 
Val Ala Val Val Gly Asp Gly Gly Phe Gin Met Thr Leu Gin Glu Leu 
450 " 455 460 

gat gtt att cgc gaa tta aat ctt ccg gtc aag gta gtg att tta aat 1440 
Asp Val He Arg Glu Leu Asn Leu Pro Val Lys Val Val He Leu Asn 
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465 470 475 480 

aac get tgt etc gga atg gtc aga cag tgg cag gaa att ttc tat gaa 1488 
Asn Ala Cys Leu Gly Met Val Arg Gin Trp Gin Glu lie Phe Tyr Glu 
485 490 495 

gaa cgt tat tea gaa tct aaa ttc get tct cag cct gac ttc gtc aaa 1536 
Glu Arg Tyr Ser Glu Ser Lys Phe Ala Ser Gin Pro Asp Phe Val Lys 
500 505 510 

ttg tec gaa gca tac ggc att aaa ggc ate aga att tea tea gaa gcg 1584 
Leu Ser Glu Ala Tyr Gly lie Lys Gly lie Arg lie Ser Ser Glu Ala 
515 520 525 

gaa gca aag gaa aag ctg gaa gag gca tta aca tea aga gaa cct gtt 1632 
Glu Ala Lys Glu Lys Leu Glu Glu Ala Leu Thr Ser Arg Glu Pro Val 
530 " 535 540 

gtc att gac gtg egg gtt gee age gaa gaa aaa gta ttc ccg atg gtg 1680 
Val He Asp Val Arg Val Ala Ser Glu Glu Lys Val Phe Pro Met Val 
545 550 555 560 

get ccg ggg aaa ggg ctg cat gaa atg gtg ggg gtg aaa cct tga 1725 
Ala Pro Gly Lys Gly Leu His Glu Met Val Gly Val Lys Pro 
565 570 



<210> 32 
<211> 574 
<212> PRT 

<213> Bacillus subtilis 
<400> 32 

Met Gly Thr Asn Val Gin Val Asp Ser Ala Ser Ala Glu Cys Thr Gin 
1 5 10 15 

Thr Met Ser Gly Ala Leu Met Leu He Glu Ser Leu Lys Lys Glu Lys 
20 25 30 

Val Glu Met He Phe Gly Tyr Pro Gly Gly Ala Val Leu Pro He Tyr 
35 40 45 

Asp Lys Leu Tyr Asn Ser Gly Leu Val His He Leu Pro Arg His Glu 
50 55 60 

Gin Gly Ala He His Ala Ala Glu Gly Tyr Ala Arg Val Ser Gly Lys 
65 70 ~ * 75 80 

Pro Gly Val Val He Ala Thr Ser Gly Pro Gly Ala Thr Asn Leu Val 
85 90 95 

Thr Gly Leu Ala Asp Ala Met He Asp Ser Leu Pro Leu Val Val Phe 
100 105 110 

Thr Gly Gin Val Ala Thr Ser Val He Gly Ser Asp Ala Phe Gin Glu 
115 120 125 



Ala Asp He Leu Gly He Thr Met Pro Val Thr Lys His Ser Tyr Gin 
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130 



135 



140 



Val Arg Gin Pro Glu Asp Leu Pro Arg lie He Lys Glu Ala Phe His 
145 150 155 160 

He Ala Thr Thr Gly Arg Pro Gly Pro Val Leu He Asp He Pro Lys 
165 170 175 

Asp Val Ala Thr lie Glu Gly Glu Phe Ser Tyr Asp His Glu Met Asn 
180 185 190 

Leu Pro Gly Tyr Gin Pro Thr Thr Glu Pro Asn Tyr Leu Gin He Arg 
195 200 205 

Lys Leu Val Glu Ala Val Ser Ser Ala Lys Lys Pro Val He Leu Ala 
210 215 220 

Gly Ala Gly Val Leu His Gly Lys Ala Ser Glu Glu Leu Lys Asn Tyr 
225 230 235 240 

Ala Glu Gin Gin Gin He Pro Val Ala His Thr Leu Leu Gly Leu Gly 
245 250 255 

Gly Phe Pro Ala Asp His Pro Leu Phe Leu Gly Met Ala Gly Met His 
260 265 270 

Gly Thr Tyr Thr Ala Asn Met Ala Leu His Glu Cys Asp Leu Leu He 
275 280 285 

Ser He Gly Ala Arg Phe Asp Asp Arg Val Thr Gly Asn Leu Lys His 
290 295 300 

Phe Ala Arg Asn Ala Lys He Ala His He Asp He Asp Pro Ala Glu 
305 310 315 320 

He Gly Lys He Met Lys Thr Gin He Pro Val Val Gly Asp Ser Lys 
325 330 335 

He Val Leu Gin Glu Leu He Lys Gin Asp Gly Lys Gin Ser Asp Ser 
340 345 ' "* 350 

Ser Glu Trp Lys Lys Gin Leu Ala Glu Trp Lys Glu Glu Tyr Pro Leu 
355 360 " 365 

Trp Tyr Val Asp Asn Glu Glu Glu Gly Phe Lys Pro Gin Lys Leu He 
370 ' 375 380 

Glu Tyr lie His Gin Phe Thr Lys Gly Glu Ala He Val Ala Thr Asp 
385 390 395 400 

Val Gly Gin His Gin Met Trp Ser Ala Gin Phe Tyr Pro Phe Gin Lys 
405 410 415 

Ala Asp Lys Trp Val Thr Ser Gly Gly Leu Gly Thr Met Gly Phe Gly 
420 425 430 



Leu Pro Ala Ala He Gly Ala Gin Leu Ala Glu Lys Asp Ala Thr Val 
435 440 445 
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Val Ala Val Val Gly Asp Gly Gly Phe Gin Met Thr Leu Gin Glu Leu 
450 455 460 

Asp Val lie Arg Glu Leu Asn Leu Pro Val Lys Val Val lie Leu Asn 
465 470 475 480 

Asn Ala Cys Leu Gly Met Val Arg Gin Trp Gin Glu lie Phe Tyr Glu 
485 490 495 

Glu Arg Tyr Ser Glu Ser Lys Phe Ala Ser Gin Pro Asp Phe Val Lys 
500 505 * 510 

Leu Ser Glu Ala Tyr Gly He Lys Gly He Arg He Ser Ser Glu Ala 
515 520 525 

Glu Ala Lys Glu Lys Leu Glu Glu Ala Leu Thr Ser Arg Glu Pro Val 
530 535 540 

Val He Asp Val Arg Val Ala Ser Glu Glu Lys Val Phe Pro Met Val 
545 550 555 560 



Ala Pro Gly Lys Gly Leu His Glu Met Val Gly Val Lys Pro 
565 570 



<210> 33 

<211> 525 

<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (522) 

<400> 33 

ttg aaa aga att ate aca ttg act gtg gtg aac cgc tec ggg gtg tta 48 
Met Lys Arg He lie Thr Leu Thr Val Val Asn Arg Ser Gly Val Leu 
15 10 15 

aac egg ate acc ggt eta ttc aca aaa agg cat tac aac att gaa age 96 
Asn Arg He Thr Gly Leu Phe Thr Lys Arg His Tyr Asn He Glu Ser 
20 25 30 

att aca gtt gga cac aca gaa aca gee ggc gtt tec aga ate acc ttc 144 
He Thr Val Gly His Thr Glu Thr Ala Gly Val Ser Arg He Thr Phe 
35 40 45 

gtc gtt cat gtt gaa ggt gaa aat gat gtt gaa cag tta acg aaa cag 192 
Val Val His Val Glu Gly Glu Asn Asp Val Glu Gin Leu Thr Lys Gin 
50 55 60 

etc aac aaa cag att gat gtg ctg aaa gtc aca gac ate aca aat caa 240 
Leu Asn Lys Gin He Asp Val Leu Lys Val Thr Asp He Thr Asn Gin 
65 70 75 80 



teg att gtc cag agg gag ctg gee tta ate aag gtt gtc tec gca cct 288 
Ser He Val Gin Arg Glu Leu Ala Leu lie Lys Val Val Ser Ala Pro 
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85 90 95 

tea aca aga aca gag att aat gga ate ata gaa ccg ttt aga gee tct 336 
Ser Thr Arg Thr Glu He Asn Gly He He Glu Pro Phe Arg Ala Ser 
100 105 110 

gtc gtt gat gtc age aga gac age ate gtt gtt cag gtg aca ggt gaa 384 
Val Val Asp Val Ser Arg Asp Ser He Val Val Gin Val Thr Gly Glu 
115 120 125 

tct aac aaa att gaa gcg ctt att gag tta tta aaa cct tat ggc att 432 
Ser Asn Lys lie Glu Ala Leu He Glu Leu Leu Lys Pro Tyr Gly He 
130 135 140 

aaa gaa ate gcg aga aca ggt aca acg get ttt gcg agg gga ace age 480 
Lys Glu He Ala Arg Thr Gly Thr Thr Ala Phe Ala Arg Gly Thr Ser 
145 150 155 160 

aaa agg cgt* cat cca ata aaa caa tat eta ttg tat aaa aca taa 525 
Lys Arg Arg His Pro He Lys Gin Tyr Leu Leu Tyr Lys Thr 
165 170 



<210> 34 
<211> 174 
<212> PRT 

<213> Bacillus subtilis 
<400> 34 

Met Lys Arg He He Thr Leu Thr Val Val Asn Arg Ser Gly Val Leu 
15 10 15 

Asn Arg lie Thr Gly Leu Phe Thr Lys Arg His Tyr Asn He Glu Ser 
20 25 30 

He Thr Val Gly His Thr Glu Thr Ala Gly Val Ser Arg He Thr Phe 
35 40 45 

Val Val His Val Glu Gly Glu Asn Asp Val Glu Gin Leu Thr Lys Gin 
50 55 60 

Leu Asn Lys Gin He Asp Val Leu Lys Val Thr Asp lie Thr Asn Gin 
65 70 75 80 

0 

Ser lie Val Gin Arg Glu Leu Ala Leu He Lys Val Val Ser Ala Pro 
85 90 95 

Ser Thr Arg Thr Glu He Asn Gly He lie Glu Pro Phe Arg Ala Ser 
100 105 110 

Val Val Asp Val Ser Arg Asp Ser lie Val Val Gin Val Thr Gly Glu 
115 120 125 

Ser Asn Lys He Glu Ala Leu lie Glu Leu Leu Lys Pro Tyr Gly lie 
130 135 140 

Lys Glu lie Ala Arg Thr Gly Thr Thr Ala Phe Ala Arg Gly Thr Ser 
145 150 155 160 
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Lys Arg Arg His Pro lie Lys Gin Tyr Leu Leu Tyr Lys Thr 
165. 170 



<210> 35 
<211> 1029 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (1026) 

<400> 35 

atg gta aaa gta tat tat aac ggt gat ate aaa gag aac gta ttg get 48 

Met Val Lys Val Tyr Tyr Asn Gly Asp He Lys Glu Asn Val Leu Ala 

15 10 15 

gga aaa aca gta gcg gtt ate ggg tac ggt teg caa ggc cac gca cat 96 
Gly Lys Thr Val Ala Val He Gly Tyr Gly Ser Gin Gly His Ala His 
20 25 30 

gee ctg aac ctt aaa gaa age gga gta gac gtg ate gtc ggt gtt aga 144 
Ala Leu Asn Leu Lys Glu Ser Gly Val Asp Val He Val Gly Val Arg 
35 40 4 5 

caa gga aaa tct ttc act caa gee caa gaa gac gga cat aaa gta ttt 192 
Gin Gly Lys Ser Phe Thr" Gin Ala Gin Glu Asp Gly His Lys Val Phe 
50 55 60 

tea gta aaa gaa gcg gca gee caa gee gaa ate ate atg gtt ctg ctt 240 
Ser Val Lys Glu Ala Ala Ala Gin Ala Glu He He Met Val Leu Leu 
65 70 75 80 

ccg gat gag cag cag caa aaa gta tac gaa get gaa ate aaa gat gaa 288 
Pro Asp Glu Gin Gin Gin Lys Val Tyr Glu Ala Glu He Lys Asp Glu 
85 90 95 

ttg aca gca gga aaa tea tta gta ttc get cat gga ttt aac gtg cat 336 
Leu Thr Ala Gly Lys Ser Leu Val Phe Ala His Gly Phe Asn Val His 
100 105 110 

ttc cat caa att gtt cct ccg gcg gat gta gat gta ttc tta gtg gee 384 
Phe His Gin He Val Pro Pro Ala Asp Val Asp Val Phe Leu Val Ala 
115 120 125 

cct aaa ggc ccg gga cac ttg gta aga aga aca tat gag caa gga get 4 32 
Pro Lys Gly Pro Gly His Leu Val Arg Arg Thr. Tyr Glu Gin Gly Ala 
130 135 140 

ggc gta cct gca ttg ttc gca ate tat caa gat gtg act gga gaa gca 480 
Gly Val Pro Ala Leu Phe Ala lie Tyr Gin Asp Val Thr Gly Glu Ala 
145 150 155 160 

aga gac aaa gee etc get tat get aaa gga ate ggc ggc gca aga gcg 528 
Arg Asp Lys Ala Leu Ala Tyr Ala Lys Gly He Gly Gly Ala Arg Ala 
165 170 175 
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ggc gta tta gaa acg aca ttt aaa gaa gaa aca gaa aca gat ttg ttc 576 

Gly Val Leu Glu Thr Thr Phe Lys Glu Glu Thr Glu Thr Asp Leu Phe 

180 185 190 

ggt gag caa gca gtt ctt tgc ggc gga tta age gcg ctt gtc aaa gec 624 

Gly Glu Gin Ala Val Leu Cys Gly Gly Leu Ser Ala Leu Val Lys Ala 
195 200 205 

gga ttt gaa acc tta act gaa gca ggt tat cag cct gaa ctt gca tac 672 

Gly Phe Glu Thr Leu Thr Glu Ala Gly Tyr Gin Pro Glu Leu Ala Tyr 
210 215 220 

ttc gag tgt ctt cat gag ctg aaa tta ate gta gac ctt atg tac gaa 720 

Phe Glu Cys Leu His Glu Leu Lys Leu He Val Asp Leu Met Tyr Glu 

225 230 235 ' 240 

gaa gga ctt gca gga atg aga tat tea ate tct gac aca gca cag tgg 768 

Glu Gly Leu Ala Gly Met Arg Tyr Ser He Ser Asp Thr Ala Gin Trp 
245 250 255 

gga gat ttc gta tea ggc cct cgc gtt gtg gac gec aaa gta aaa gaa 816 

Gly Asp Phe Val Ser Gly Pro Arg Val Val Asp Ala Lys Val Lys Glu 

260 265 270 

tct atg aaa gaa gta tta aaa gat ate caa aac ggt aca ttc gca aaa 864 

Ser Met Lys Glu Val Leu Lys Asp He Gin Asn Gly Thr Phe Ala Lys 
275 280 285 

gag tgg ate gtc gaa aac caa gta aac cgt cct cgt ttc aac get ate 912 

Glu Trp He Val Glu Asn Gin Val Asn Arg Pro Arg Phe Asn Ala lie 
290 295 300 

aat gca age gag aac gaa cat caa ate gaa gta gtg gga aga aag ctt 960 

Asn Ala Ser Glu Asn Glu His Gin He Glu Val Val Gly Arg Lys Leu 

305 310 315 J J 320 

cgt gaa atg atg ccg ttt gtg aaa caa ggc aag aag aag gaa gcg gtg 1008 

Arg Glu Met Met Pro Phe Val Lys Gin Gly Lys Lys Lys Glu Ala Val 
325 330 ' 335 

gtc tec gtt gcg caa aat taa 1029 
Val Ser Val Ala Gin Asn 
340 



<210> 36 
<211> 342 
<212> PRT 

<213> Bacillus subtilis 
<400> 36 

Met Val Lys Val Tyr Tyr Asn Gly Asp lie Lys Glu Asn Val Leu Ala 
15 10 15 

Gly Lys Thr Val Ala Val He Gly Tyr Gly Ser Gin Gly His Ala His 
20 25 30 
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Ala Leu Asn Leu Lys Glu Ser Gly Val Asp Val lie Val Gly Val Arg 

35 40 45 

Gin Gly Lys Ser Phe Thr Gin Ala Gin Glu Asp Gly His Lys Val Phe 

50 55 60 

Ser Val Lys Glu Ala Ala Ala Gin Ala Glu He He Met Val Leu Leu 

65 70 75 80 

Pro Asp Glu Gin Gin Gin Lys Val Tyr Glu Ala Glu He Lys Asp Glu 
85 90 95 

Leu Thr Ala Gly Lys Ser Leu Val Phe Ala His Gly Phe Asn Val His 

100 105 " 110 

Phe His Gin He Val Pro Pro Ala Asp Val Asp Val Phe Leu Val Ala 

115 120 125 

Pro Lys Gly Pro Gly His Leu Val Arg Arg Thr Tyr Glu Gin Gly Ala 

130 135 140 

Gly Val Pro Ala Leu Phe Ala He Tyr Gin Asp Val Thr Gly Glu Ala 

145 150 155 160 

Arg Asp Lys Ala Leu Ala Tyr Ala Lys Gly He Gly Gly Ala Arg Ala 



Gly Val Leu Glu Thr Thr Phe Lys Glu Glu Thr Glu Thr Asp Leu Phe 

180 185 190 

Gly Glu Gin Ala Val Leu Cys Gly Gly Leu Ser Ala Leu Val Lys Ala 

195 200 205 

Gly Phe Glu Thr Leu Thr Glu Ala Gly Tyr Gin Pro Glu Leu Ala Tyr 

210 215 220 

Phe Glu Cys Leu His Glu Leu Lys Leu He Val Asp Leu Met Tyr Glu 

225 230 235 240 

Glu Gly Leu Ala Gly Met Arg Tyr Ser lie Ser Asp Thr Ala Gin Trp 
245 250 255 

Gly Asp Phe Val Ser Gly Pro Arg Val Val Asp Ala Lys Val Lys Glu 

260 265 270 

Ser Met Lys Glu Val Leu Lys Asp He Gin Asn Gly Thr Phe Ala Lys 

275 280 285 

Glu Trp He Val Glu Asn Gin Val Asn Arg Pro Arg Phe Asn Ala He 

290 295 3.00 

Asn Ala Ser Glu Asn Glu His Gin He Glu Val Val Gly Arg Lys Leu 

305 310 315 320 

Arg Glu Met Met Pro Phe Val Lys Gin Gly Lys Lys Lys Glu Ala Val 



165 



170 



175 



325 



330 



335 



Val Ser Val Ala Gin Asn 
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340 



<210> 37 

<211> 1674 

<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (1674) 

<400> 37 

atg gca gaa tta cgc agt aat atg ate aca caa gga ate gat aga get 48 

Met Ala Glu Leu Arg Ser Asn Met lie Thr Gin Gly He Asp Arg Ala 

1 5 10 15 

ccg cac cgc agt ttg ctt cgt gca gca ggg gta aaa gaa gag gat ttc 96 
Pro His Arg Ser Leu Leu Arg Ala . Ala Gly Val Lys Glu Glu Asp Phe 
20 25 30 

ggc aag ccg ttt att gcg gtg tgt aat tea tac att gat ate gtt ccc 144 
Gly Lys Pro Phe He Ala Val Cys Asn Ser Tyr He Asp He Val Pro 
35 40 45 

ggt cat gtt cac ttg cag gag ttt ggg aaa ate gta aaa gaa gca ate 192 
Gly His Val His Leu Gin Glu Phe Gly Lys He Val Lys Glu Ala He 
50 55 60 

aga gaa gca ggg ggc gtt ccg ttt gaa ttt aat ace att ggg gta gat 240 
Arg Glu Ala Gly Gly Val Pro Phe Glu Phe Asn Thr He Gly Val Asp 
65 70 75 . 80 ' 

gat ggc ate gca atg ggg cat ate ggt atg aga tat teg ctg cca age 288 
Asp Gly He Ala Met Gly His He Gly Met Arg Tyr Ser Leu Pro Ser 
85 90 95 

cgt gaa att ate gca gac tct gtg gaa acg gtt gta tec gca cac tgg 336 
Arg Glu lie He Ala Asp Ser Val Glu Thr Val Val Ser Ala His Trp 
100 105 110 

ttt gac gga atg gtc tgt att ccg aac tgc gac aaa ate aca ccg gga 384 
Phe Asp Gly Met Val Cys He Pro Asn Cys Asp Lys He Thr Pro Gly 
115 120 125 

atg ctt atg gcg gca atg cgc ate aac att ccg acg att ttt gtc age 432 
Met Leu Met Ala Ala Met Arg He Asn He Pro Thr He Phe Val Ser 
130 135 140 

ggc gga ccg atg gcg gca gga aga aca agt tac ggg cga aaa ate tec 4 80 
Gly Gly Pro Met Ala Ala Gly Arg Thr Ser Tyr Gly Arg Lys He Ser 
145 150 " 155 160 

ctt tec tea gta ttc gaa ggg gta ggc gee tac caa gca ggg aaa ate 528 
Leu Ser Ser Val Phe Glu Gly Val Gly Ala Tyr Gin Ala Gly Lys He 
165 170 175 



aac gaa aac gag ctt caa gaa eta gag cag ttc gga tgc cca acg tgc 



576 
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Asn Glu Asn Glu Leu Gin Glu Leu Glu Gin Phe Gly Cys Pro Thr Cys 

180 185 190 

ggg tct tgc tea ggc atg ttt acg gcg aac tea atg aac tgt ctg tea 624 

Gly Ser Cys Ser Gly Met Phe Thr Ala Asn Ser Met Asn Cys Leu Ser 

195 200 205 

gaa gca ctt ggt ctt get ttg ccg ggt aat gga ace att ctg gca aca 672 

Glu Ala Leu Gly Leu Ala Leu Pro Gly Asn Gly Thr lie Leu Ala Thr 

210 215 220 

tct ccg gaa cgc aaa gag ttt gtg aga aaa teg get gcg caa tta atg 720 

Ser Pro Glu Arg Lys Glu Phe Val Arg Lys Ser Ala Ala Gin Leu Met 

225 230 235 240 

gaa acg att cgc aaa gat ate aaa ccg cgt gat att gtt aca gta aaa 768 

Glu Thr lie Arg Lys Asp lie Lys Pro Arg Asp lie Val Thr Val Lys 

245 250 255 

gcg att gat aac gcg ttt gca etc gat atg gcg etc gga ggt tct aca 816 

Ala lie Asp Asn Ala Phe Ala Leu Asp Met Ala Leu Gly Gly Ser Thr 

260 265 270 

aat acc gtt ctt cat ace ctt gee ctt gca aac gaa gee ggc gtt gaa 864 

Asn Thr Val Leu His Thr Leu Ala Leu Ala Asn Glu Ala Gly Val Glu 

275 280 285 

tac tct tta gaa cgc att aac gaa gtc get gag cgc gtg ccg cac ttg 912 

Tyr Ser Leu Glu Arg lie Asn Glu Val Ala Glu Arg Val Pro His Leu 

290 295 300 

get aag ctg gcg cct gca teg gat gtg ttt att gaa gat ctt cac gaa 960 

Ala Lys Leu Ala Pro Ala Ser Asp Val Phe lie Glu Asp Leu His Glu 

305 310 315 320 

gcg ggc ggc gtt tea gcg get ctg aat gag ctt teg aag aaa gaa gga 1008 

Ala Gly Gly Val Ser Ala Ala Leu Asn Glu Leu Ser Lys Lys Glu Gly 

325 330 335 

gcg ctt cat tta gat gcg ctg act gtt aca gga aaa act ctt gga gaa 1056 

Ala Leu His Leu Asp Ala Leu Thr Val Thr Gly Lys Thr Leu Gly Glu 

340 345 350 

acc att gee gga cat gaa gta aag gat tat gac gtc att cac ccg ctg 1104 

Thr lie Ala Gly His Glu Val Lys Asp Tyr Asp Val lie His Pro Leu 

355 360 365 

gat caa cca ttc act gaa aag gga ggc ctt get gtt tta ttc ggt aat 1152 

Asp Gin Pro Phe Thr Glu Lys Gly Gly Leu Ala Val Leu Phe Gly Asn 

370 375 380 

eta get ccg gac ggc get ate att aaa aca ggc ggc gta cag aat ggg 1200 

Leu Ala Pro Asp Gly Ala lie lie Lys Thr Gly Gly Val Gin Asn Gly 

385 390 395 400 



att aca aga cac gaa ggg ccg get gtc gta ttc gat tct cag gac gag 
lie Thr Arg His Glu Gly Pro Ala Val Val Phe Asp Ser Gin Asp Glu 
405 410 415 



1248 
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gcg ctt gac ggc att ate aac cga aaa gta aaa gaa ggc gac gtt gtc 1296 

Ala Leu Asp Gly lie lie Asn Arg Lys Val Lys Glu Gly Asp Val Val 

420 425 430 

ate ate aga tac gaa ggg cca aaa ggc gga cct ggc atg ccg gaa atg 134 4 

lie lie Arg Tyr Glu Gly Pro Lys Gly Gly Pro Gly Met Pro Glu Met 

435 440 445 

ctg gcg cca aca tec caa ate gtt gga atg gga etc ggg cca aaa gtg 1392 

Leu Ala Pro Thr Ser Gin He Val Gly Met Gly Leu Gly Pro Lys Val 

450 455 460 

gca ttg att acg gac gga cgt ttt tec gga gee tec cgt ggc etc tea 1440 

Ala Leu He Thr Asp Gly Arg Phe Ser Gly Ala Ser Arg Gly Leu Ser 

465 470 475 480 

ate ggc cac gta tea cct gag gee get gag ggc ggg ccg ctt gee ttt 1488 

He Gly His Val Ser Pro Glu Ala Ala Glu Gly Gly Pro Leu Ala Phe 

485 490 495 

gtt gaa aac gga' gac cat att ate gtt gat att gaa aaa cgc ate ttg 1536 

Val Glu Asn Gly Asp His He He Val Asp lie Glu Lys Arg He Leu 

500 505 510 

gat gta caa gtg cca gaa gaa gag tgg gaa aaa cga aaa gcg aac tgg 1584 

Asp Val Gin- Val Pro Glu Glu Glu Trp Glu Lys Arg Lys Ala Asn Trp 

515 520 525 

aaa ggt ttt gaa ccg aaa gtg aaa acc ggc tac ctg gca cgt tat tct 1632 

Lys Gly Phe Glu Pro Lys Val Lys Thr Gly Tyr Leu Ala Arg Tyr Ser 

530 535 540 

aaa ctt gtg aca agt gec aac acc ggc ggt att atg aaa ate 1674 

Lys Leu Val Thr Ser Ala Asn Thr Gly Gly He Met Lys He 

545 550 555 



<210> 38 
<211> 558 
<212> PRT 

<213> Bacillus subtilis 
<400> 38 

Met Ala Glu Leu Arg Ser Asn Met He Thr Gin Gly lie Asp Arg Ala 
15 10 15 

Pro His Arg Ser Leu Leu Arg Ala Ala Gly Val Lys Glu Glu Asp Phe 
20 " 25 30 

Gly Lys Pro Phe lie Ala Val Cys Asn Ser Tyr He Asp lie Val Pro 
35 40 45 

Gly His Val His Leu Gin Glu Phe Gly Lys lie Val Lys Glu Ala lie 
50 55 60 

Arg Glu Ala Gly Gly Val Pro Phe Glu Phe Asn Thr lie Gly Val Asp 
65 70 75 80 
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Asp Gly lie Ala Met Gly His lie Gly Met Arg Tyr Ser Leu Pro Ser 
85 " 90 95 

Arg Glu He He Ala Asp Ser Val Glu Thr Val Val Ser Ala His Trp 
100 105 110 

Phe Asp Gly Met Val Cys He Pro Asn Cys Asp Lys He Thr Pro Gly 
115 120 125 

Met Leu Met Ala Ala Met Arg He Asn He Pro Thr lie Phe Val Ser 
130 135 140 

Gly Gly Pro Met Ala Ala Gly Arg Thr Ser Tyr Gly Arg Lys He Ser 
145 150 155 160 

Leu Ser Ser Val Phe Glu Gly Val Gly Ala Tyr Gin Ala Gly Lys He 
165 170 175 

Asn Glu Asn Glu Leu Gin Glu Leu Glu Gin Phe Gly Cys Pro Thr Cys 
180 185 190 

Gly Ser Cys Ser Gly Met Phe Thr Ala Asn Ser Met Asn Cys Leu Ser 
195 200 205 

Glu Ala Leu Gly Leu Ala Leu Pro Gly Asn Gly Thr lie Leu Ala Thr 
210 215 220 

Ser Pro Glu Arg Lys Glu Phe Val Arg Lys Ser Ala Ala Gin Leu Met 
225 230 235 240 

Glu Thr He Arg Lys Asp lie Lys Pro Arg Asp lie Val Thr Val Lys 
245 250 255 

Ala He Asp Asn Ala Phe Ala Leu Asp Met Ala Leu Gly Gly Ser Thr 
260 265 270 

Asn Thr Val Leu His Thr Leu Ala Leu Ala Asn Glu Ala Gly Val Glu 
275 280 285 

Tyr Ser Leu Glu Arg He Asn Glu Val Ala Glu Arg Val Pro His Leu 
290 295 300 

Ala Lys Leu Ala Pro Ala Ser Asp Val Phe lie Glu Asp Leu His Glu 
305 310 315 320 

Ala Gly Gly Val Ser Ala Ala Leu Asn Glu Leu Ser Lys Lys Glu Gly 
325 330 335 

Ala Leu His Leu Asp Ala Leu Thr Val Thr Gly Lys Thr Leu Gly Glu 
340 345 350 

Thr lie Ala Gly His Glu Val Lys Asp Tyr Asp Val lie His Pro Leu 
355 360 " 365 

Asp Gin Pro Phe Thr Glu Lys Gly Gly Leu Ala Val Leu Phe Gly Asn 
370 375 380 
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Leu Ala Pro Asp 
385 

lie Thr Arg His 



Ala Leu 



He He 



Leu Ala 
450 

Ala Leu 
465 



Asp Gly 
420 

Arg Tyr 
435 



Gly Ala He He 
390 

Glu Gly Pro Ala 
405 

He He Asn Arg 



Glu Gly Pro Lys 
440 



Pro Thr Ser Gin He Val 
455 

He Thr Asp Gly Arg Phe 
470 



-48- 

Lys Thr Gly Gly Val Gin Asn Gly 
395 400 

Val Val Phe Asp Ser Gin Asp Glu 
410 415 

Lys Val Lys Glu Gly Asp Val Val 
425 430 

Gly Gly Pro Gly Met Pro Glu Met 
445 

Gly Met Gly Leu Gly Pro Lys Val 
460 

Ser Gly Ala Ser Arg Gly Leu Ser 

475 480 



He Gly His Val 



Val Glu 



Asp Val 



Lys Gly 
530 

Lys Leu 
545 



Asn Gly 
500 

Gin Val 
515 



Ser Pro Glu Ala 
485 

Asp His He He 



Pro Glu Glu Glu 
520 



Phe Glu Pro Lys Val Lys 
535 

Val Thr Ser Ala Asn Thr 
550 



Ala Glu 
490 

Val Asp 
505 

Trp Glu 
Thr Gly 
Gly Gly 



Gly Gly 
He Glu 
Lys Arg 



Tyr Leu 
540 

lie Met 
555 



Pro Leu Ala Phe 
495 

Lys Arg lie Leu 
510 

Lys Ala Asn Trp 
525 

Ala Arg Tyr Ser 



Lys He 



<210> 39 
<211> 194 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : promoter 
sequence 

<220> 

<221> -35_signal 
<222> (136) . . (141) 

<220> 

<221> -10_signal 
<222> (159) . . (164) 

<400> 39 

gctattgacg acagctatgg ttcactgtcc accaaccaaa actgtgctca gtaccgccaa 60 

tatttctccc ttgaggggta caaagaggtg tccctagaag agatccacgc tgtgtaaaaa 120 

ttttacaaaa aggtattgac tttccctaca gggtgtgtaa taatttaatt acaggcgggg 180 
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gcaaccccgc ctgt . 194 



<210> 40 
<211> 163 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : promoter 
sequence 

<220> 

<221> -35__signal 
<222> (113) . . (118) 

<220> 

<221> -10_signal 
<222> (136) . . (141) 

<400> 40 

gcctacctag cttccaagaa agatatccta acagcacaag agcggaaaga tgttttgttc 60 

tacatccaga acaacctctg ctaaaattcc tgaaaaattt tgcaaaaagt tgttgacttt 120 

atctacaagg tgtggtataa taatcttaac aacagcagga cgc 163 



<210> 41 
<211> 127 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : promoter 
sequence 

<220> 

<221> -35_signal 
<222> (34) . . (39) 

<220> 

<221> -10_signal 
<222> (58) . . (63) 

<220> 

<221> -35_signal 
<222> (75) . . (80) 

<220> 

<221> -10_signal 
<222> (98) . . (103) 

<400> 41 

gaggaatcat agaattttgt caaaataatt ttattgacaa cgtcttatta acgttgatat 60 
aatttaaatt ttatttgaca aaaatgggct cgtgttgtac aataaatgta gtgaggtgga 120 
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tgcaatg 



127 



<210> 42 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribosome 
binding site 

<400> 42 

taaacatgag gaggagaaaa catg 24 

<210> 43 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribosome 
binding site 



<210> 44 
<211> 13 
<212> DNA 

<213> Artificial Sequence 
<22C> 

<223> Description of Artificial Sequence : ribosome 
binding site 

<400> 44 

agaaaggagg tga 13 

<210> 45 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> • 

<223> Description of Artificial Sequence : ribosome 
binding site 



<400> 43 
' attcgagaaa tggagagaat ataatatg 



28 



<400> 45 

ttaagaaagg aggtgannnn atg 



23 



<210> 46 
<211> 23 
<212> DNA 



<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : ribosome 
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binding site 
<400> 46 

ttagaaagga ggtgannnnn atg 



<210> 47 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribosome 
binding site 

<400> 47 

agaaaggagg tgannnnnnn atg 

<210> 48 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribosome 
binding site 

<400> 48 

agaaaggagg tgannnnnna tg 

<210> 49 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribosome 
binding site 

<400> 49 

ccctctagaa ggaggagaaa acatg 

<210> 50 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribosome 
binding site 

<400> 50 

ccctctagag gaggagaaaa catg 

<210> 51 
<211> 23 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribosome 
binding site 

<400> 51 

ttagaaagga ggatttaaat atg 23 

<210> 52 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribosome 
binding site 



<210> 53 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribosome 
binding site 

<400> 53 

ttagaaagga ggtgatttaa atg 23 

<210> 54 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribosome 
binding site 



<210> 55 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribosome 
binding site 

<400> 55 

attcgagaaa ggaggtgaat ataatatg 28 



<400> 52 

ttagaaagga ggtttaatta atg 



23 



<400> 54 

ttagaaagga ggtgtttaaa atg 



23 



<210> 56 
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<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ribosome 
binding site 

<400> 56 

attcgagaaa ggaggtgaat aataatg 27 

<210> 57 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribosome 
binding site 

<400> 57 

attcgtagaa aggaggtgaa ttaatatg 28 



<210> 58 
<211> 3291 
<212> DNA 

<213> Bacillus subtilis 
<400> 58 



atggggacta 


atgtacaggt 


ggattcagca 


tc.tgccgaat 


gtacacagac 


gatgagcgga 


60 


gcattaatgc 


tgattgaatc 


attaaaaaaa 


gagaaagtag 


aaatgatctt 


cggttatccg 


120 


ggcggggctg 


tgcttccgat 


ttacgataag 


ctatacaatt 


cagggttggt 


acatatcctt 


180 


ccccgtcacg 


aacaaggagc 


aattcatgca 


gcggagggat 


acgcaagggt 


ctccggaaaa 


240 


ccgggtgtcg 


tcattgccac 


gtcagggccg 


ggagcgacaa 


accttgttac 


aggccttgct 


300 


gatgccatga 


ttgattcatt 


gccgttagtc 


gtctttacag 


ggcaggtagc 


aacctctgta 


360 


atcgggagcg 


atgcatttca 


ggaagcagac 


attttaggga 


ttacgatgcc 


agtaacaaaa 


420 


cacagctacc 


aggttcgcca 


gccggaagat 


ctgccgcgca 


tcattaaaga 


agcgttccat 


480 


attgcaacaa 


ctggaagacc 


cggacctgta 


ttgattgata 


ttccgaaaga 


tgtagcaaca 


540 


attgaaggag 


aattcagcta 


cgatcatgag 


atgaatctcc 


cgggatacca 


gccgacaaca 


600 


gagccgaatt 


atttgcagat 


ccgcaagctt 


gtggaagccg 


tgagcagtgc 


gaaaaaaccg 


660 


gtgatcctgg 


cgggtgcggg 


cgtactgcac 


ggaaaagcgt 


cagaagaatt 


aaaaaattat 


720 


gctgaacagc 


agcaaatccc 


tgtggcacac 


acccttttgg 


ggctcggagg 


cttcccggct 


780 


gaccatccgc 


ttttcctagg 


gatggcggga 


atgcacggta 


cttatacagc 


caatatggcc 


840 
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cttcatgaat 


gtgatctatt 


aatcagtatc 


ggcgcccgtt 


ttgatgaccg 


tgtcacagga 


900 


aacctgaaac 


actttgccag 


aaacgcaaag 


atagcccaca 


tcgatattga 


tccagctgaa 


960 


atcggaaaaa 


tcatgaaaac 


acagattcct 


gtagtcggag 


acagcaaaat 


tgtcctgcag 


1020 


gagctgatca 


aacaagacgg 


caaacaaagc 


gattcaagcg 


aatggaaaaa 


acagctcgca 


1080 


gaatggaaag 


aagagtatcc 


gctctggtat 


gtagataatg 


aagaagaagg 


ttttaaacct 


1140 


cagaaattga 


ttgaatatat 


tcatcaattt 


acaaaaggag 


aggccattgt 


cgcaacggat 


1200 


gtaggccagc 


atcaaatgtg 


gtcagcgcaa 


ttttatccgt 


tccaaaaagc 


agataaatgg 


1260 


gtcacgtcag 


gcggacttgg 

3 3 ZJ 3 3 


aacgatggga 


ttcggtcttc 


cggcggcgat 


cggcgcacag 


1320 


ctggccgaaa 


aagatgctac 


tgttgtcgcg 


qttqtcqqaq 

3 3 ZJ ZJ ZJ 


acqqcqqatt 

33 33 


ccaaatgacg 


1380 


cttcaagaac 


tcgatgttat 


tcgcgaatta 


aatcttccgg 


tcaaggtagt 


gattttaaat 


1440 


aacgcttgtc 


tcggaatggt 


caqacaqtqq 

3 3 "35 


caggaaattt 


tctatgaaga 


acgttattca 


1500 


gaatctaaat 


tcgcttctca 


gcctgacttc 


gtcaaattgt 


ccgaagcata 


cggcattaaa 


1560 


ggcatcagaa 


tttcatcaga 


aqcqqaaqca 


aaggaaaagc 


tgqaaqaqqc 


attaacatca 


1620 


agagaaectg 


ttgtcattga 


cgtqcgqgtt 

zj zj zj zj z> 


gccagcgaag 


aaaaagtatt 


cccgatggtg 


1680 


gctccgggga 


aaqqqctqca 


tqaaatqqtq 

Z» 3 3 zj 


qqqqtqaaac 

3 3 3 3 3 


cttgaaaaga 


attatcacat 


1740 


tgactgtggt 


gaaccgctcc 


ggggtgttaa 


accggatcac 


cggtctattc 


acaaaaaggc 


1800 


attacaacat 


tgaaagcatt 


acagttggac 


acacagaaac 


agccggcgtt 


tccagaatca 


1860 


ccttcgtcgt 


tcatgttgaa 


ggtgaaaatg 


atgttgaaca 


gttaacgaaa 


cagctcaaca 


1920 


aacagattga 


tgtgctgaaa 


gtcacagaca 


tcacaaatca 


atcgattgtc 


cagagggagc 


1980 


tggccttaat 


caaggttgtc 


tccgcacctt 


caacaagaac 


agagattaat 


ggaatcatag 


2040 


aaccgtttag 


agcctctgtc 


gttgatgtca 


gcagagacag 


catcgttgtt 


caggtgacag 


2100 


gtgaatctaa 


caaaattgaa 


gcgcttattg 


agttattaaa 


accttatggc 


attaaagaaa 


2160 


tcgcgagaac 


aggtacaacg 


gcttttgcga 


ggggaaccag 


caaaaggcgt 


catccaataa 


2220 


aacaatat ct 


attgtataaa 


acataacaag 


ggagagattg 


aaatggtaaa 


agta cat cat 




aacggtgata 


tcaaagagaa 


cgtattggct 


ggaaaaacag 


tagcggttat 


cgggtacggt 


2340 


tcgcaaggcc 


acgcacatgc 


cctgaacctt 


aaagaaagcg 


gagtagacgt 


gatcgtcggt 


2400 


gttagacaag 


gaaaatcttt 


cactcaagcc 


caagaagacg 


gacataaagt 


attttcagta 


2460 


aaagaagcgg 


cagcccaagc 


cgaaatcatc 


atggttctgc 


ttccggatga 


gcagcagcaa 


2520 


aaagtatacg 


aagctgaaat 


caaagatgaa 


ttgacagcag 


gaaaatcatt 


agtattcgct 


2580 
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catggattta 


acgtgcattt 


ccatcaaatt 


gttcctccgg 


cggatgtaga 

3 3 3 3 


tgtattctta 


2640 


gtggccccta 


aaqqcccqqq 


acacttggta 


agaagaacat 


atqaqcaaqq 

3 3 33 


aqctqqcata 

u 3 0< -33 v 3 


2700 


cctgcattgt 


tcgcaatcta 


tcaagatgtg 


actqqaqaaq 


caagagacaa 


agccctcgct 


2760 


tatgctaaag 


qaatcqqcqq 

1J 3 3 3 3 


cqcaaqaqcq 


ggcgtattag 


aaacgacatt 


taaagaagaa 


2820 


acagaaacag 


atttgt tcgg 


tgagcaagca 


gttctttgcg 


gcggattaag 


cgcgcttgtc 


2880 


aaagccggat 


ttgaaacctt 


aactgaagca 


ggttatcagc 


ctgaacttgc 


at act t cgag 


2940 


tgtcttcatg 


agctgaaatt 


aatcgtagac 


cttatgtacg 


aagaaggact 


tgcaggaatg 


3000 


agatattcaa 


tctctgacac 


agcacagtgg 


ggagatttcg 


tatcaggccc 


tcgcgttgtg 


3060 


gacgccaaag 


taaaagaatc 


tatgaaagaa 


gtattaaaag 


atatccaaaa 


cggtacattc 


3120 


gcaaaagagt 


ggatcgtcga 


aaaccaagta 


aaccgtcctc 


gtttcaacgc 


tatcaatgca 


3180 


agcgagaacg 


aacatcaaat 


cgaagtagtg 


ggaagaaagc 


ttcgtgaaat 


gatgccgttt 


3240 


gtgaaacaag 


gcaagaagaa 


ggaagcggtg 


gtctccgttg 


cgcaaaatta 


a 


3291 



<210> 59 
<211> 2363 
<212> DNA 

<213> Bacillus subtilis 

<220> 
<221> CDS 

<222> (242) . . (1072) 

<220> 
<221> CDS 

<222> (1077) . . (1934) 

<220> 
<221> CDS 

<222> (1939) . . (2319) 
<400> 59 



ttggtacaag cccgttgatt 


ttggtatact 


tccattgggc 


agtatcgect 


gegaactgea 


60 


cctattatta aaatagatag 


acattgcagc 


agtctgeett 


gatccaaaaa 


aggactggga 


120 


cagagggatg aaactcgccg 


aactttagaa 


agtgaagaat 


ccttctcgtt 


gtaacggaag 


180 


gttttttggc ttgcagaaga 


aaacggcaga 


tcatctcctc 


taaacatgag 


gaggagaaaa 


240 


c atg aaa aca aaa ctg 


gat ttt eta aaa atg aag gag tct gaa gaa ccg 


289 



Met Lys Thr Lys Leu Asp Phe Leu Lys Met Lys Glu Ser Glu Glu Pro 
15 10 15 

att gtc atg ctg acc get tat gat tat ccg gca get aaa ctt get gaa 337 
He Val Met Leu Thr Ala Tyr Asp Tyr Pro Ala Ala Lys Leu Ala Glu 
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20 25 30 

caa gcg gga gtt gac atg att tta gtc ggt gat tea ctt gga atg gtc 385 
Gin Ala Gly Val Asp Met lie Leu Val Gly Asp Ser Leu Gly Met Val 
35 40 45 

gtc etc ggc ctt gat tea act gtc ggt gtg aca gtt gcg gac atg ate 433 
Val Leu Gly Leu Asp Ser Thr Val Gly Val Thr Val Ala Asp Met He 
50 55 60 

cat cat aca aaa gec gtt aaa agg ggt gcg ccg aat acc ttt att gtg 4 81 
His His Thr Lys Ala Val Lys Arg Gly Ala Pro Asn Thr Phe He Val 
65 70 75 80 

aca gat atg ccg ttt atg tct tat cac ctg tct aag gaa gat acg ctg 529 
Thr Asp Met Pro Phe Met Ser Tyr His Leu Ser Lys Glu Asp Thr Leu 
85 90 95 

aaa aat gca gcg get ate gtt cag gaa age gga get gac gca ctg aag 577 
Lys Asn Ala Ala Ala He Val Gin Glu Ser Gly Ala Asp Ala Leu Lys 
100 105 L 110 

ctt gag ggc gga gaa ggc gtg ttt gaa tec att cgc gca ttg acg ctt 625 
Leu Glu Gly Gly Glu Gly Val Phe Glu Ser He Arg Ala Leu Thr Leu 
115 120 125 

gga ggc att cca gta gtc agt cac tta ggt ttg aca ccg cag tea gtc 673 
Gly Gly He Pro Val Val Ser His Leu Gly Leu Thr Pro Gin Ser Val 
130 135 140 

ggc gta ctg ggc ggc tat aaa gta cag ggc aaa gac gaa caa age gee 721 
Gly Val Leu Gly Gly Tyr Lys Val Gin Gly Lys Asp Glu Gin Ser Ala 
145 150 155 160 

aaa aaa tta ata gaa gac agt ata aaa tgc gaa gaa gca gga get atg 769 
Lys Lys Leu He Glu Asp Ser He Lys Cys Glu Glu Ala Gly Ala Met 
165 170 175 

atg ctt gtg ctg gaa tgt gtg ccg gca gaa etc aca gec aaa att gee 817 
Met Leu Val Leu Glu Cys Val Pro Ala Glu Leu Thr Ala Lys lie Ala 
180 185 190 

gag acg eta age ata ccg gtc att gga ate ggg get ggt gtg aaa gcg 865 
Glu Thr Leu Ser lie Pro Val lie Gly lie Gly Ala Gly Val Lys Ala 
195 200 205 

gac gga caa gtt etc gtt tat cat gat att ate ggc cac ggt gtt gag 913 
Asp Gly Gin Val Leu Val Tyr His Asp lie lie Gly His Gly Val Glu 
210 215 220 

aga aca cct aaa ttt gta aag caa tat acg cgc att gat gaa acc ate 961 
Arg Thr Pro Lys Phe Val Lys Gin Tyr Thr Arg lie Asp Glu Thr lie 
225 230 235 240 

gaa aca gca ate age gga tat gtt cag gat gta aga cat cgt get ttc 1009 
Glu Thr Ala lie Ser Gly Tyr Val Gin Asp Val Arg His Arg Ala Phe 
245 250 255 
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cct gaa caa aag cat tec ttt caa atg aac cag aca gtg ctt gac ggc 1057 
Pro Glu Gin Lys His Ser Phe Gin Met Asn Gin Thr Val Leu Asp Gly 
260 265 270 

ttg tac ggg gga aaa taag atg aga cag att act gat att tea cag ctg 1106 
Leu Tyr Gly Gly Lys Met Arg Gin lie Thr Asp lie Ser Gin Leu 

275 280 285 

aaa gaa gec ata aaa caa tac cat tea gag ggc aag tea ate gga ttt 1154 
Lys Glu Ala He Lys Gin Tyr His Ser Glu Gly Lys Ser He Gly Phe 
290 295 300 

gtt ccg acg atg ggg ttt ctg cat gag ggg cat tta ace tta gca gac 1202 
Val Pro Thr Met Gly Phe Leu His Glu Gly His Leu Thr Leu Ala Asp 
305 310 315 

aaa gca aga caa gaa aac gac gec gtt att atg agt att ttt gtg aat 1250 
Lys Ala Arg Gin Glu Asn Asp Ala Val He Met Ser He Phe Val Asn 
320 325 330 335 

cct gca caa ttc ggc cct aat gaa gat ttt gaa gca tat ccg cgc gat 1298 
Pro Ala Gin Phe Gly Pro Asn Glu Asp Phe Glu Ala Tyr Pro Arg Asp 
340 345 350 

att gag egg gat gca get ctt gca gaa aac gec gga gtc gat att ctt 134 6 
He Glu Arg Asp Ala Ala Leu Ala Glu Asn Ala Gly Val Asp He Leu 
355 360 365 

ttt acg cca gat get cat gat atg tat ccc ggt gaa aag aat gtc acg 1394 
Phe Thr Pro Asp Ala His Asp Met Tyr Pro Gly Glu Lys Asn Val Thr 
370 375 380 

att cat gta gaa aga cgc aca gac gtg tta tgc ggg cgc tea aga gaa 1442 
lie His Val Glu Arg Arg Thr Asp Val Leu Cys Gly Arg Ser Arg Glu 
385 390 395 

gga cat ttt gac ggg gtc gcg ate gta ctg acg aag ctt ttc aat eta 14 90 
Gly His Phe Asp Gly Val Ala He Val Leu Thr Lys Leu Phe Asn Leu 
400 405 410 415 

gtc aag ccg act cat gec tat ttc ggt tta aaa gat gcg cag cag gta 1538 
Val Lys Pro Thr Arg Ala Tyr Phe Gly Leu Lys Asp Ala Gin Gin Val 
420 425 430 

get gtt gtt gat ggg tta ate age gac ttc ttc atg gat att gaa ttg 1586 
Ala Val Val Asp Gly Leu He Ser Asp Phe Phe Met Asp He Glu Leu 
435 440 445 

gtt cct gtc gat acg gtc aga gag gaa gac ggc tta gee aaa age tct 1634 
Val Pro Val Asp Thr Val Arg Glu Glu Asp Gly Leu Ala Lys Ser Ser 
450 455 " 460 

cgc aat gta tac tta aca get gag gaa aga aaa gaa gcg cct aag ctg 1682 
Arg Asn Val Tyr Leu Thr Ala Glu Glu Arg Lys Glu Ala Pro Lys Leu 
465 470 475 



tat egg gec ctt caa aca agt gcg gaa ctt gtc caa gee ggt gaa aga 
Tyr Arg Ala Leu Gin Thr Ser Ala Glu Leu Val Gin Ala Gly Glu Arg 



1730 
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480 485 490 495 

gat cct gaa gcg gtg ata aaa get gca aaa gat ate att gaa acg act 1778 
Asp Pro Glu Ala Val He Lys Ala Ala Lys Asp He He Glu Thr Thr 
500 505 510 

age gga ace ata gac tat gta gag ctt tat tec tat ccg gaa etc gag 1826 
Ser Gly Thr He Asp Tyr Val Glu Leu Tyr Ser Tyr Pro Glu Leu Glu 
515 520 525 

cct gtg aat gaa att get gga aag atg att etc get gtt gca gtt get 1874 
Pro Val Asn Glu He Ala Gly Lys Met He Leu Ala Val Ala Val Ala 
530 535 540 

ttt tea aaa gcg cgt tta ata gat aat ate att att gat att cga gaa 1922 
Phe Ser Lys Ala Arg Leu He Asp Asn lie lie lie Asp lie Arg Glu 
545 550 555 

atg gag aga ata taat atg tat cga aca atg atg age ggc aaa ctt cac 1971 
Met Glu Arg lie Met Tyr Arg Thr Met Met Ser Gly Lys Leu His 

560 565 570 

agg gca act gtt acg gaa gca aac ctg aac tat gtg gga age att aca 2019 
Arg Ala Thr Val Thr Glu Ala Asn Leu Asn Tyr Val Gly Ser lie Thr . 
575 580 585 590 

att gat gaa gat etc att gat get gtg gga atg ctt cct aat gaa aaa 2067 
lie Asp Glu Asp Leu lie Asp Ala Val Gly Met Leu Pro Asn Glu Lys 
595 600 605 

gta caa att gtg aat aat aat aat gga gca cgt ctt gaa acg tat att 2115 
Val Gin lie Val Asn Asn Asn Asn Gly Ala Arg Leu Glu Thr Tyr lie 
610 615 620 

att cct ggt aaa egg gga age ggc gtc ata tgc tta aac ggt gca gee 2163 
He Pro Gly Lys Arg Gly Ser Gly Val lie Cys Leu Asn Gly Ala Ala 
625 630 635 

gca cgc ctt gtg cag gaa gga gat aag gtc att att att tec tac aaa 2211 
Ala Arg Leu Val Gin Glu Gly Asp Lys Val lie lie lie Ser Tyr Lys 
64 0 64 5 650 

atg atg tct gat caa gaa gcg gca age cat gag ccg aaa gtg get gtt 2259 
Met Met Ser Asp Gin Glu Ala Ala Ser His Glu Pro Lys Val Ala Val 
655 660 665 ' 670 

ctg aat gat caa aac aaa att gaa caa atg ctg ggg aac gaa cca gee 2307 
Leu Asn Asp Gin Asn Lys lie Glu Gin Met Leu Gly Asn Glu Pro Ala 
675 680 685 

cgt aca att ttg tagaagaaaa gcccccttta tcgggggttt tcttttaaga tttt 2363 
Arg Thr lie Leu 
690 



<210> 60 
<211> 293 
<212> PRT 
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<213> Bacillus subtilis 



<400> 60 

Met Ser lie Ala Val Ser Glu Glu Glu Ala Lys Ala Val Glu Gly Leu 
15 10 15 

Asn Asp Tyr Leu Ser Val Glu Glu Val Glu Thr lie Tyr He Pro Leu 
20 25 30 

Val Arg Leu Leu His Leu His Val Lys Ser Ala Ala Glu Arg Asn Lys 
35 40 45 

His Val Asn Val Phe Leu Lys His Pro His Ser Ala Lys He Pro Phe 
50 55 60 

He He Gly He Ala Gly Ser Val Ala Val Gly Lys Ser Thr Thr Ala 
65 70 75 80 

Arg Lie Leu Gin Lys Leu Leu Ser Arg Leu Pro Asp Arg Pro Lys Val 
85 90 95 

Ser Leu He Thr Thr Asp Gly Phe Leu Phe Pro Thr Ala Glu Leu Lys 
100 105 110 

Lys Lys Asn Met Met Ser Arg Lys Gly Phe Pro Glu Ser Tyr Asp Val 
115 120 125 

Lys Ala Leu Leu Glu Phe Leu Asn Asp Leu Lys Ser Gly Lys Asp Ser 
130 135 140 

Val Lys Ala Pro Val Tyr Ser His Leu Thr Tyr Asp Arg Glu Glu Gly 
145 150 155 160 

Val Phe Glu Val Val Glu Gin Ala Asp He Val He He Glu Gly lie 
165 170 175 

Asn Val Leu Gin Ser Pro Thr Leu Glu Asp Asp Arg Glu Asn Pro Arg 
180 185 190 

lie Phe Val Ser Asp Phe Phe Asp Phe Ser He Tyr Val Asp Ala Glu 
195 200 205 

Glu Ser Arg lie Phe Thr Trp Tyr Leu Glu Arg Phe Arg Leu Leu Arg 
210 215 220 

Glu Thr Ala Phe Gin Asn Pro Asp Ser Tyr Phe His Lys Phe Lys Asp 
225 230 235 " 240 

Leu Ser Asp Gin Glu Ala Asp Glu Met Ala Ala Ser He Trp Glu Ser 
245 250 255 

Val Asn Arg Pro Asn Leu Tyr Glu Asn lie Leu Pro Thr Lys . Phe Arg 
260 265 270 

Ser Asp Leu lie Leu Arg Lys Gly Asp Gly His Lys Val Glu Glu Val 
275 280 285 



Leu Val Arg Arg Val 
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60 



290 



<210> 61 
<211> 281 
<212> PRT 

<213> Bacillus subtilis 
<400> 61 

Met Glu Gly Leu Asn Asp Tyr Leu Ser Val Glu Glu Val Glu Thr He 
15 10 15 

Tyr He Pro Leu Val Arg Leu Leu His Leu His Val Lys Ser Ala Ala 
20 25 30 

Glu Arg Asn Lys His Val Asn Val Phe Leu Lys His Pro His Ser Ala 
35 40 45 

Lys He Pro Phe He He Gly He Ala Gly Ser Val Ala Val Gly Lys 
50 55 60 

Ser Thr Thr Ala Arg lie Leu Gin Lys Leu Leu Ser Arg Leu Pro Asp 
65 70 75 80 

Arg Pro Lys Val Ser Leu He Thr Thr Asp Gly Phe Leu Phe Pro Thr 
85 90 95 

Ala Glu Leu Lys Lys Lys Asn Met Met Ser Arg Lys Gly Phe Pro Glu 
100 105 110 

Ser Tyr Asp Val Lys Ala Leu Leu Glu Phe Leu Asn Asp Leu Lys Ser 
115 120 125 

Gly Lys Asp Ser Val Lys Ala Pro Val Tyr Ser His Leu Thr Tyr Asp 
130 135 140 

Arg Glu Glu Gly Val Phe Glu Val Val Glu Gin Ala Asp He Val He 
145 150 155 160 

He Glu Gly He Asn Val Leu Gin Ser Pro Thr Leu Glu Asp Asp Arg 
165 170 175 

Glu Asn Pro Arg He Phe Val Ser Asp Phe Phe Asp Phe Ser He Tyr 
180 185 190 

Val Asp Ala Glu Glu Ser Arg He Phe Thr Trp Tyr Leu Glu Arg Phe 
195 200 205 

Arg Leu Leu Arg Glu Thr Ala Phe Gin Asn Pro Asp Ser Tyr Phe His 
210 215 220 

Lys Phe Lys Asp Leu Ser Asp Gin Glu Ala Asp Glu Met Ala Ala Ser 
225 230 235 240 

He Trp Glu Ser Val Asn Arg Pro Asn Leu Tyr Glu Asn He Leu Pro 
245 250 255 

Thr Lys Phe Arg Ser Asp Leu He Leu Arg Lys Gly Asp Gly His Lys 
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260 265 270 

Val Glu Glu Val Leu Val Arg Arg Val 
275 280 



<210> 62 
<211> 1092 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (1089) 

<400> 62 

atg act aaa caa aca att cgc gtt gaa ttg aca tea aca aaa aaa ccg 48 
Met Thr Lys Gin Thr lie Arg Val Glu Leu Thr Ser Thr Lys Lys Pro 
15 10 15 

aaa cca gac cca aat cag ctt teg ttc gga aga gtg ttt aca gac cac 96 
Lys Pro Asp Pro Asn Gin Leu Ser Phe Gly Arg Val Phe Thr Asp His 
20 25 30 

atg ttt gta atg gac tat gec gca gat aaa ggt tgg tac gat cca aga 144 
Met Phe Val Met Asp Tyr Ala Ala Asp Lys Gly Trp Tyr Asp Pro Arg 
35 40 45 

ate att cct tat caa ccc tta tea atg gat cca act gca atg gtc tat 192 
lie lie Pro Tyr Gin Pro Leu Ser Met Asp Pro Thr Ala Met Val Tyr 
50 55 60 

cac tac ggc caa ace gtg ttt gaa ggg tta aag get tac gtg tea gag 240 
His Tyr Gly Gin Thr Val Phe Glu Gly Leu Lys Ala Tyr Val Ser Glu 
65 70 75 80 

gat gac cat gtt ctg ctt ttc aga ccg gaa aaa aat atg gaa cgc ctg 288 
Asp Asp His Val Leu Leu Phe Arg Pro Glu Lys Asn Met Glu Arg Leu 
85 90 95 

aat caa tea aac gac cgc etc tgc ate ccg caa att gat gaa gaa cag 336 
Asn Gin Ser Asn Asp Arg Leu pys He Pro Gin He Asp Glu Glu Gin 
100 105 110 

gtt ctt gaa ggc tta aag cag ctt gtc gca att gat aaa gac tgg att 384 
Val Leu Glu Gly Leu Lys Gin Leu Val Ala He Asp Lys Asp Trp He 
115 120 125 

cca aat gcg gag ggc acg tec ctt tac ate cgt ccg ttc ate ate gca 4 32 
Pro Asn Ala Glu Gly Thr Ser Leu Tyr He Arg Pro Phe He lie Ala 
130 135 140 

ace gag cct ttc ctt ggt gtt gcg gca tct cat acg tat aag etc ttg 480 
Thr Glu Pro Phe Leu Gly Val Ala Ala Ser His Thr Tyr Lys Leu Leu 
145 150 155 160 

ate att ctt tct ccg gtc ggc tct tat tac aaa gaa ggc att aag ccg 528 
He He Leu Ser Pro Val Gly Ser Tyr Tyr Lys Glu Gly He Lys Pro 
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165 170 175 

gtc aaa ate get gtt gaa agt gaa ttt gtc cgt gcg gta aaa ggc gga 576 

Val Lys He Ala Val Glu Ser Glu Phe Val Arg Ala Val Lys Gly Gly 

180 185 190 

aca gga aat gee aaa ace gca gga aac tat get tea age tta aaa gcg 624 

Thr Gly Asn Ala Lys Thr Ala Gly Asn Tyr Ala Ser Ser Leu Lys Ala 

195 200 205 

cag cag gta gee gaa gag aaa gga ttt tct caa gta etc tgg ctg gac 672 

Gin Gin Val Ala Glu Glu Lys Gly Phe Ser Gin Val Leu Trp Leu Asp 

210 215 220 

ggc att gag aag aaa tac ate gaa gaa gtc gga age atg aac ate ttc 720 

Gly He Glu Lys Lys Tyr He Glu Glu Val Gly Ser Met Asn He Phe 

225 230 235 240 

ttc aaa ate aac ggt gaa ate gta aca ccg atg ctg aac ggg age ate 768 

Phe Lys He Asn Gly Glu He Val Thr Pro Met Leu Asn Gly Ser He 

245 250 255 

ctg gaa ggc att acg cgc aat tea gtc ate gee ttg ctt aag cat tgg 816 

Leu Glu Gly lie Thr Arg Asn Ser Val He Ala Leu Leu Lys His Trp 

260 265 270 

ggc ctt caa gtt tea gaa cga aaa att gcg ate gat gag gtc ate caa 864 

Gly Leu Gin Val Ser Glu Arg Lys lie Ala lie Asp Glu Val lie Gin 

275 280 285 

gee cat aaa gac ggc ate ctg gaa gaa gee ttc gga aca ggt aca gca 912- 

Ala His Lys Asp Gly He Leu Glu Glu Ala Phe Gly Thr Gly Thr Ala 

290 295 300 

get gtt att tec cca gtc ggc gag ctg ate tgg cag gat gaa aca ctt 960 

Ala Val lie Ser Pro Val Gly Glu Leu He Trp Gin Asp Glu Thr Leu 

305 310 315 320 

teg ate aac aac ggt gaa aca gga gaa ate gca aaa aaa eta tat gac 1008 

Ser He Asn Asn Gly Glu Thr Gly Glu He Ala Lys Lys Leu Tyr Asp 

325 330 ~ ^ 335 

acg att aca ggc att caa aaa ggc get gtc gca gac gaa ttc gga tgg 1056 

Thr lie Thr Gly lie Gin Lys Gly Ala Val Ala Asp Glu Phe Gly Trp 

340 345 350 

acg ace gaa gtc gca gcg ctg act gaa age aag taa 1092 

Thr Thr Glu Val Ala Ala Leu Thr Glu Ser Lys 
355 360 



<210> 63 
<211> 363 
<212> PRT 

<213> Bacillus subtilis 



<400> 63 

Met Thr Lys Gin Thr lie Arg Val Glu Leu Thr Ser Thr Lys Lys Pro 
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10 



15 



Lys Pro Asp Pro Asn Gin Leu Ser Phe Gly Arg Val Phe Thr Asp His 
20 25 30 

Met Phe Val Met Asp Tyr Ala Ala Asp Lys Gly Trp Tyr Asp Pro Arg 
35 40 45 

lie He Pro Tyr Gin Pro Leu Ser Met Asp Pro Thr Ala Met Val Tyr 
50 55 60 

His Tyr Gly Gin Thr Val Phe Glu Gly Leu Lys Ala Tyr Val Ser Glu 
65 70 75 80 

Asp Asp His Val Leu Leu Phe Arg Pro Glu Lys Asn Met Glu Arg Leu 
85 90 95 

Asn Gin Ser Asn Asp Arg Leu Cys He Pro Gin He Asp Glu Glu Gin 
100 105 110 

Val Leu Glu Gly Leu Lys Gin Leu Val Ala He Asp Lys Asp Trp He 
115 120 125 

Pro Asn Ala Glu Gly Thr Ser Leu Tyr He Arg Pro Phe He He Ala 
130 135 140 

Thr Glu Pro Phe Leu Gly Val Ala Ala Ser His Thr Tyr Lys Leu Leu 
145 150 155 160 

He He Leu Ser Pro Val Gly Ser Tyr Tyr Lys Glu Gly He Lys Pro 
165 170 175 

Val Lys lie Ala Val Glu Ser Glu Phe Val Arg Ala Val Lys Gly Gly 
180 185 190 

Thr Gly Asn Ala Lys Thr Ala Gly Asn Tyr Ala Ser Ser Leu Lys Ala 
195 200 205 

Gin Gin Val Ala Glu Glu Lys Gly Phe Ser Gin Val Leu Trp Leu Asp 
210 215 220 

Gly He Glu Lys Lys Tyr He Glu Glu Val Gly Ser Met Asn lie Phe 
225 230 235 240 

Phe Lys He Asn Gly Glu lie Val Thr Pro Met Leu Asn Gly Ser lie 
245 250 255 

Leu Glu Gly lie Thr Arg Asn Ser Val lie Ala Leu Leu Lys His Trp 
260 265 270 

Gly Leu Gin Val Ser Glu Arg Lys lie Ala lie Asp Glu Val lie Gin 
275 280 285 



Ala His Lys Asp Gly lie Leu Glu Glu Ala Phe Gly Thr Gly Thr Ala 
290 295 300 



Ala Val lie Ser Pro Val Gly Glu Leu lie Trp Gin Asp Glu Thr Leu 
305 310 ' 315 320 
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Ser lie Asn Asn Gly Glu Thr Gly Glu lie Ala Lys Lys Leu Tyr Asp 
325 330 335 

Thr He Thr Gly He Gin Lys Gly Ala Val Ala Asp Glu Phe Gly Trp 
340 345 350 

Thr Thr Glu Val Ala Ala Leu Thr Glu Ser Lys 
355 360 



<210> 64 

<211> 1071 

<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (1068) 

<400> 64 

ttg aat aag ctt att gaa cga gaa aaa act gta tat tat aag gaa aag 48 

Met Asn Lys Leu He Glu Arg Glu Lys Thr Val Tyr Tyr Lys Glu Lys 

15 10 * 15 

ccc gac ccg tct tec ttg ggg ttt gga caa tat ttt aca gat tat atg 96 
Pro Asp Pro Ser Ser Leu Gly Phe Gly Gin Tyr Phe Thr Asp Tyr Met 
20 25 30 

ttt gtg atg gac tac gaa gag ggg att gga tgg cat cat ccg aga att 14 4 
Phe Val Met Asp Tyr Glu Glu Gly He Gly Trp His His Pro Arg He 
35 40 45 

gcg ccg tac gca ccg ctt acg ctt gat ccg tct tea tct gtt ttt cat 192 
Ala Pro Tyr Ala Pro Leu Thr Leu Asp Pro Ser Ser Ser Val Phe His 
50 55 60 

tac ggc cag get gtt ttt gaa gga tta aaa gca tac aga aca gac gac 24 0 
Tyr Gly Gin Ala Val Phe Glu Gly Leu Lys Ala Tyr Arg Thr Asp Asp 
65 70 75 80 

ggc agg gtg ctg ctg ttc cgt ccg gat caa aat ate aaa egg ctg aac 288 
Gly Arg Val Leu Leu Phe Arg Pro Asp Gin Asn He Lys Arg Leu Asn 
85 90 95 

aga teg tgt gag cgc atg age atg ccc cct tta gac gaa gag ctg gtg 336 
Arg Ser Cys Glu Arg Met Ser Met Pro Pro Leu Asp Glu Glu Leu Val 
100 105 110 

ctt gag gca ttg acg caa tta gtt gag ctg gag aaa gat tgg gtt cca 384 
Leu Glu Ala Leu Thr Gin Leu Val Glu Leu Glu Lys Asp Trp Val Pro 
115 120 125 

aag gaa aaa gga acg tea ctg tat att cgt cct ttt gtc att gec aca 432 
Lys Glu Lys Gly Thr Ser Leu Tyr He Arg Pro Phe Val He Ala Thr 
130 135 * 140 



gaa ccg agt etc ggt gtg aag gca tec agg age tat aca ttt atg ate 480 
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Glu Pro Ser Leu Gly Val Lys Ala Ser Arg Ser Tyr Thr Phe Met lie 
145 150 155 160 

gtg ctt teg cct gtc ggc tec tat tat ggc gac gat cag ctg aag ccg 528 
Val Leu Ser Pro Val Gly Ser Tyr Tyr Gly Asp Asp Gin Leu Lys Pro 
165 170 175 

gtt aga ate tat gtc gaa gat gag tat gtg agg gcg gtc aac gga gga 576 
Val Arg He Tyr Val Glu Asp Glu Tyr Val Arg Ala Val Asn Gly Gly 
180 185 190 

gtc ggg ttt gca aaa acg get gga aac tat gec gee agt ctt cag gca 624 
Val Gly Phe Ala Lys Thr Ala Gly Asn Tyr Ala Ala Ser Leu Gin Ala 
195 200 205 

cag egg aaa gcg aat gaa ctg ggc tat gac cag gta ctg tgg ctg gac 672 
Gin Arg Lys Ala Asn Glu Leu Gly Tyr Asp Gin Val Leu Trp Leu Asp 
210 215 220 

gee ate gaa aag aaa tat gtg gaa gaa gta ggg age atg aac ate ttt 720 
Ala He Glu Lys Lys Tyr Val Glu Glu Val Gly Ser Met Asn He Phe 
225 230 . 235 240 

ttc gtc ata aac ggg gaa get gtc aca cct get tta age gga age att 768 
Phe Val He Asn Gly Glu Ala Val Thr Pro Ala Leu Ser Gly Ser lie 
245 250 255 

tta age ggg gtt aca cgt gcg tct gcg att gaa ttg att cga age tgg 816 
Leu Ser Gly Val Thr Arg Ala Ser Ala He Glu Leu He Arg Ser Trp 
260 265 270 

ggc att ccg gtt cgt gaa gag aga ata teg att. gat gag gtg tat gcg 864 
Gly He Pro Val Arg Glu Glu Arg He Ser He Asp Glu Val Tyr Ala 
275 280 285 

gee tct gca cgc gga gaa ttg aca gag gtc ttt ggc aca ggc acg gca 912 
Ala Ser Ala Arg Gly Glu Leu Thr Glu Val Phe Gly Thr Gly Thr Ala 
290 295 300 

gca gtc gtt acg cct gtc ggt gaa etc aac ate cat gga aaa acg gtg 960 
Ala Val Val Thr Pro Val Gly Glu Leu Asn *Ile His Gly Lys Thr Val 
305 310 315 320 

att gta ggc gac ggg caa ate ggg gac etc teg aaa aag ctg tat gaa 1008 
lie Val Gly Asp Gly Gin lie Gly Asp Leu Ser Lys Lys Leu Tyr Glu 
325 330 335 

acg ata aca gat att cag ctt ggc aag gta aaa ggc ccg ttt aac tgg 1056 
Thr lie Thr Asp lie Gin Leu Gly Lys Val Lys Gly Pro Phe Asn Trp 
340 345 350 

aca gtg gaa gtg tga 1071 
Thr Val Glu Val 
355 



<210> 65 
<211> 356 
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<212> PRT 

<213> Bacillus subtilis 
<400> 65 

Met Asn Lys Leu lie Glu Arg Glu Lys Thr Val Tyr Tyr Lys Glu Lys 
15 10 15 

Pro Asp Pro Ser Ser Leu Gly Phe Gly Gin Tyr Phe Thr Asp Tyr Met 
20 25 30 

Phe Val Met Asp Tyr Glu Glu Gly He Gly Trp His His Pro Arg He 
35 40 45 

Ala Pro Tyr Ala Pro Leu Thr Leu Asp Pro Ser Ser Ser Val Phe His 
50 55 60 

Tyr Gly Gin Ala Val Phe Glu Gly Leu Lys Ala Tyr Arg Thr Asp Asp 
65 70 75 80 

Gly Arg Val Leu Leu Phe Arg Pro Asp Gin Asn lie Lys Arg Leu Asn 
85 90 95 

Arg Ser Cys Glu Arg Met Ser Met Pro Pro Leu Asp Glu Glu Leu Val 
100 105 110 

Leu Glu Ala Leu Thr Gin Leu Val Glu Leu Glu Lys Asp Trp Val Pro 
115 120 125 

Lys Glu Lys Gly Thr Ser Leu Tyr He Arg Pro Phe Val He Ala Thr 
130 135 140 

Glu Pro Ser Leu Gly Val Lys Ala Ser Arg Ser Tyr Thr Phe Met He 
145 150 . 155 160 

Val Leu Ser Pro Val Gly Ser Tyr Tyr Gly Asp Asp Gin Leu Lys Pro 
165 170 175 

Val Arg He Tyr Val Glu Asp Glu Tyr Val Arg Ala Val Asn Gly Gly 
180 185 190 

Val Gly Phe Ala Lys Thr Ala Gly Asn Tyr Ala Ala Ser Leu Gin Ala 

195 200 205 

Gin Arg Lys Ala Asn Glu Leu Gly Tyr Asp Gin Val Leu Trp Leu Asp 
210 215 220 

Ala He Glu Lys Lys Tyr Val Glu Glu Val Gly Ser Met Asn He Phe 
225 230 235 240 

Phe Val He Asn Gly Glu Ala Val Thr Pro Ala Leu Ser Gly Ser He 
245 250 255 

Leu Ser Gly Val Thr Arg Ala Ser Ala He Glu Leu He Arg Ser Trp 
260 265 270 

Gly He Pro Val Arg Glu Glu Arg He Ser He Asp Glu Val Tyr Ala 
275 280 285 
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Ala Ser Ala Arg Gly Glu Leu Thr Glu Val Phe Gly Thr Gly Thr Ala 
290 295 300 

Ala Val Val Thr Pro Val Gly Glu Leu Asn He His Gly Lys Thr Val 
305 310 315 320 

He Val Gly Asp Gly Gin He Gly Asp Leu Ser Lys Lys Leu Tyr Glu 
325 330 335 

Thr He Thr Asp He Gin Leu Gly Lys Val Lys Gly Pro Phe Asn Trp 
340 345 350 

Thr Val Glu Val 
355 



<210> 66 
<211> 1428 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (1425) 

<400> 66 

atq tta aac ggc caa aaa gaa tat cgc gtg gaa aaa gac ttc ctt ggg 48 

Met Leu Asn Gly Gin Lys Glu Tyr Arg Val Glu Lys Asp Phe Leu Gly 

1 5 10 15 

gaa aaa caa att gaa gca gat gtt tat tac gga att cag acg etc cgt 96 
Glu Lys Gin He Glu Ala Asp Val Tyr Tyr Gly He Gin Thr Leu Arg 
20 25 30 

get tct gaa aat ttt ccg ate aca gga tac aaa ate cat gag gaa atg 144 
Ala Ser Glu Asn Phe Pro He Thr Gly Tyr Lys He His Glu Glu Met 
35 40 45 

att aac gca ctg gcg att gtg aaa aaa get gcg get ctt gee aac atg 192 
He Asn Ala Leu Ala He Val Lys Lys Ala Ala Ala Leu Ala Asn Met 
50 55 60 

gac gtg aaa egg ctg tat gaa gga att ggc caa get ate gta caa gee 240 
Asp Val Lys Arg Leu Tyr Glu Gly He Gly Gin Ala He Val Gin Ala 
65 70 75 80 

get gac gag att ctg gaa ggc aag tgg cac gat cag ttt ate gtc gat 288 
Ala Asp Glu He Leu Glu Gly Lys Trp His Asp Gin Phe He Val Asp 
• 85 90 95 

ccg att cag ggc ggt gee gga act tct atg aac atg aac gcg aat gag 336 
Pro He Gin Gly Gly Ala Gly Thr Ser Met Asn Met Asn Ala Asn Glu 
100 105 110 



gtt ate gga aac egg gcg ctt gaa ate atg gga cat aaa aag gga gat 384 
Val He Gly Asn Arg Ala Leu Glu He Met Gly His Lys Lys Gly Asp 
115 120 125 
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tat ate cat tta agt cca aac aca cat gtg aac atg tea cag tct cag 432 

Tyr lie His Leu Ser Pro Asn Thr His Val Asn Met Ser Gin Ser Gin 
130 135 140 

aac gat gtg ttc ccg act get ate cat att tec aca ttg aag etc tta 480 

Asn Asp Val Phe Pro Thr Ala lie His lie Ser Thr Leu Lys Leu Leu 
145 150 155 160 

gaa aaa ctg ctg aaa aca atg gaa gat atg cat agt gtg ttt aaa caa 528 

Glu Lys Leu Leu Lys Thr Met Glu Asp Met His Ser Val Phe Lys Gin 

165 170 175 

aaa gca cag gag ttt cac tct gtt att aaa atg ggc egg aca cac ctt 576 

Lys Ala Gin Glu Phe His Ser Val lie Lys Met Gly Arg Thr His Leu 
180 185 190 

caa gat gcg gtt ccg ate cgt ctt ggc cag gaa ttc gaa get tac age 624 

Gin Asp Ala Val Pro He Arg Leu Gly Gin Glu Phe Glu Ala Tyr Ser 
195 200 205 

cgt gtt etc gag cgt gat ate aaa cga ate aag caa teg cgc cag cac 672 

Arg Val Leu Glu Arg Asp He Lys Arg He Lys Gin Ser Arg Gin His 
210 215 ~ 220 

ctg tat gaa gtc aac atg ggc gca act get gtt ggt aca ggg ctg aac 720 

Leu Tyr Glu Val Asn Met Gly Ala Thr Ala Val Gly Thr Gly Leu Asn 
225 230 235 240 

get gat cot gaa tat ate aaa cag gta gta aag cac ctt get gat att 768 

Ala Asp Pro Glu Tyr He Lys Gin Val Val Lys His Leu Ala Asp lie' 

245 250 255 

age ggg ctt cct ctt gtc ggc get gat cat ctt gtt gat gcg aca caa 816 

Ser Gly Leu Pro Leu Val Gly Ala Asp His Leu Val Asp Ala Thr Gin 
260 265 270 

aat aca gat gee tat aca gag gta tea get tea tta aaa gtc tgc atg 864 

Asn Thr Asp Ala Tyr Thr Glu Val Ser Ala Ser Leu Lys Val Cys Met 
275 280 285 

atg aac atg teg aag ate gca aac gac ctg cgc tta atg gcg teg gga 912 

Met Asn Met Ser Lys He Ala Asn Asp Leu Arg Leu Met Ala Ser Gly 
290 295 300 

ccg cgc gec gga ctt gcg gaa att tct ctg cct gca cgt cag ccg ggt 960 

Pro Arg Ala Gly Leu Ala Glu He Ser Leu Pro Ala Arg Gin Pro Gly 
305 310 315 320 

tea tct att atg ccg ggg aaa gtc aat ccg gtt atg gcg gag ctg ate 1008 

Ser Ser He Met Pro Gly Lys Val Asn Pro Val Met Ala Glu Leu He 

325 330 335 

aac caa att gcg ttc cag gtt ate gga aat gac aat aca ate tgc ctt 1056 

Asn Gin He Ala Phe Gin Val lie Gly Asn Asp Asn Thr He Cys Leu 
340 345 350 

get tea gaa gee ggc cag ctt gag ttg aac gtc atg gag ccc gtg ctt 1104 

Ala Ser Glu Ala Gly Gin Leu Glu Leu Asn Val Met Glu Pro Val Leu 
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355 360 365 

gtc ttt aat ttg ctt caa tec ate age ate atg aac aac ggc ttc cgt 1152 
Val Phe Asn Leu Leu Gin Ser lie Ser lie Met Asn Asn Gly Phe Arg 
370 375 380 

teg ttc act gac aac tgc tta aaa ggc att gaa gee aac gaa aag cgt 1200 
Ser Phe Thr Asp Asn Cys Leu Lys Gly lie Glu Ala Asn Glu Lys Arg 
385 390 395 400 

atg aag caa tac gta gaa aaa age gca ggc gtg ate aca get gtc aat 1248 
Met Lys Gin Tyr Val Glu Lys Ser Ala Gly Val He Thr Ala Val Asn 
405 410 415 

ccg cat ctt ggg tat gaa gcg gca get aga att gec agg gaa gca att 1296 
Pro His Leu Gly Tyr Glu Ala Ala Ala Arg He Ala Arg Glu Ala He 
420 425 430 

atg aca ggg caa tct gtc egg gat ctt tgt ctg cag cat gat gtg ctg 1344 
Met Thr Gly Gin Ser Val Arg Asp Leu Cys Leu Gin His Asp Val Leu 
435 440 445 

act gaa gaa gaa ttg gat att att tta aac cca tat gag atg ace aaa 1392 
Thr Glu Glu Glu Leu Asp He He Leu Asn Pro Tyr Glu Met Thr Lys 
450 455 460 

cca ggt ate gca ggg aaa gaa eta tta gaa aaa taa 1428 
Pro Gly He Ala Gly Lys Glu Leu Leu Glu Lys 
465 470 475 



<210> 67 
<211> 475 
<212> PRT 

<213> Bacillus subtilis 
<400> 67 

Met Leu Asn Gly Gin Lys Glu Tyr Arg Val Glu Lys Asp Phe Leu Gly 
15 10 15 

Glu Lys Gin He Glu Ala Asp Val Tyr Tyr Gly He Gin Thr Leu Arg 
20 25 30 

Ala Ser Glu Asn Phe Pro He Thr Gly Tyr Lys He His Glu Glu Met 
35 40 45 

He Asn Ala Leu Ala He Val Lys Lys Ala Ala Ala Leu Ala Asn Met 
50 55 60 

Asp Val Lys Arg Leu Tyr Glu Gly He Gly Gin Ala lie Val Gin Ala 
65 70 75 80 

Ala Asp Glu He Leu Glu Gly Lys Trp His Asp Gin Phe He Val Asp 
85 90 95 

Pro He Gin Gly Gly Ala Gly Thr Ser Met Asn Met Asn Ala Asn Glu 
100 105 110 
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Val He Gly Asn Arg Ala Leu Glu He Met Gly His Lys Lys Gly Asp 
115 120 125 

Tyr He His Leu Ser Pro Asn Thr His Val Asn Met Ser Gin Ser Gin 
130 135 140 

Asn Asp Val Phe Pro Thr Ala He His He Ser Thr Leu Lys Leu Leu 
145 150 155 160 

Glu Lys Leu Leu Lys Thr Met Glu Asp Met His Ser Val Phe Lys Gin 
165 170 175 

Lys Ala Gin Glu Phe His Ser Val He Lys Met Gly Arg Thr His Leu 
180 185 190 

Gin Asp Ala Val Pro He Arg Leu Gly Gin Glu Phe Glu Ala Tyr Ser 
195 200 205 

Arg Val Leu Glu Arg Asp He Lys Arg He Lys Gin Ser Arg Gin His 
210 215 220 

Leu Tyr Glu Val Asn Met Gly Ala Thr Ala Val Gly Thr Gly Leu Asn 
225 230 235 240 

Ala Asp Pro Glu Tyr He Lys Gin Val Val Lys His Leu Ala Asp lie 
245 250 255 

Ser Gly Leu Pro Leu Val Gly Ala Asp His Leu Val Asp Ala Thr Gin 
260 265 . 270 

Asn Thr Asp Ala Tyr Thr Glu Val Ser Ala Ser Leu Lys Val Cys Met 
275 280 285 

Met Asn Met Ser Lys He Ala Asn Asp Leu Arg Leu Met Ala Ser Gly 
290 295 300 

Pro Arg Ala Gly Leu Ala Glu He Ser Leu Pro Ala Arg Gin Pro Gly 
305 310 315 320 

Ser Ser He Met Pro Gly Lys Val Asn Pro Val Met Ala Glu Leu lie 
325 330 335 

Asn Gin lie Ala Phe Gin Val He Gly Asn Asp Asn Thr lie Cys Leu 
340 345 350 

Ala Ser Glu Ala Gly Gin Leu Glu Leu Asn Val Met Glu Pro Val Leu 
355 360 365 

Val Phe Asn Leu Leu Gin Ser lie Ser lie Met Asn Asn Gly Phe Arg 
370 375 380 

Ser Phe Thr Asp Asn Cys Leu Lys Gly lie Glu Ala Asn Glu Lys Arg 
385 390 395 400 



Met Lys Gin Tyr Val Glu Lys Ser Ala Gly Val lie Thr Ala Val Asn 
405 410 415 



Pro His Leu Gly Tyr Glu Ala Ala Ala Arg lie Ala Arg Glu Ala lie 
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420 425 430 

Met Thr Gly Gin Ser Val Arg Asp Leu Cys Leu Gin His Asp Val Leu 
435 440 445 

Thr Glu Glu Glu Leu Asp lie lie Leu Asn Pro Tyr Glu Met Thr Lys 
450 455 460 

Pro Gly lie Ala Gly Lys Glu Leu Leu Glu Lys 
465 470 475 



<210> 68 

<211> 768 

<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (765) 

<400> 68 

atg aaa cga gaa age aac att caa gtg etc age cgt ggt caa aaa gat 4 8 

Met Lys Arg Glu Ser Asn lie Gin Val Leu Ser Arg Gly Gin Lys Asp 
1 5 10 15 

cag cct gtg age cag att tat caa gta tea aca atg act tct eta tta 96 
Gin Pro Val Ser Gin He Tyr Gin Val Ser Thr Met Thr Ser Leu Leu 
20 25 30 

gac gga gta tat gac gga gat ttt gaa ctg tea gag att ccg aaa tat 144 
Asp Gly Val Tyr Asp Gly Asp Phe Glu Leu Ser Glu He Pro Lys Tyr 
35 40 45 

gga gac ttc ggt ate gga ace ttt aac aag ctt gac gga gag ctg att 192 
Gly Asp Phe Gly He Gly Thr Phe Asn Lys Leu Asp Gly Glu Leu He 
50 55 60 

ggg ttt gac ggc gaa ttt tac cgt ctt cgc tea gac gga acc gcg aca 240 
Gly Phe Asp Gly Glu Phe Tyr Arg Leu Arg Ser Asp Gly Thr Ala Thr 
65 70 75 80 

ccg gtc caa aat gga gac cgt tea ccg ttc tgt tea ttt acg ttc ttt 288 
Pro Val Gin Asn Gly Asp Arg Ser Pro Phe Cys Ser Phe Thr Phe Phe 
85 90 95 

aca ccg gac atg acg cac aaa att gat gcg aaa atg aca cgc gaa gac 336 
Thr Pro Asp Met Thr His Lys He Asp Ala Lys Met Thr Arg Glu Asp 
100 105 110 

ttt gaa aaa gag ate aac age atg ctg cca age aga aac tta ttt tat 384 
Phe Glu Lys Glu He Asn Ser Met Leu Pro Ser Arg Asn Leu Phe Tyr 
115 120 125 

gca att cgc att gac gga ttg ttt aaa aag gtg cag aca aga aca gta 432 
Ala He Arg He Asp Gly Leu Phe Lys Lys Val Gin Thr Arg Thr Val 
130 135 140 
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gaa ctt caa gaa aaa cct tac gtg cca atg gtt gaa gcg gtc aaa aca 4 80 

Glu Leu Gin Glu Lys Pro Tyr Val Pro Met Val Glu Ala Val Lys Thr 
145 150 155 * 160 

cag ccg att ttc aac ttc gac aac gtg aga gga acg att gta ggt ttc 528 

Gin Pro lie Phe Asn Phe Asp Asn Val Arg Gly Thr He Val Gly Phe 
165 170 175 

ttg aca cca get tat gca aac gga ate gec gtt tct ggc tat cac ctg 576 

Leu Thr Pro Ala Tyr Ala Asn Gly He Ala Val Ser Gly Tyr His Leu 

180 185 190 

cac ttc att gac gaa gga cgc aat tea ggc gga cac gtt ttt gac tat 624 

His Phe He Asp Glu Gly Arg Asn Ser Gly Gly His Val Phe Asp Tyr 
195 200 205 

gtg ctt gag gat tgc acg gtt acg att tct caa aaa atg aac atg aat 672 

Val Leu Glu Asp Cys Thr Val Thr He Ser Gin Lys Met Asn Met Asn 
210 215 220 

etc aga ctt ccg aac aca gcg gat ttc ttt aat gcg aat ctg gat aac 720 

Leu Arg Leu Pro Asn Thr Ala Asp Phe Phe Asn Ala Asn Leu Asp Asn 
225 230 ~ 235 240 

cct gat ttt gcg aaa gat ate gaa aca act gaa gga age cct gaa taa 768 

Pro Asp Phe Ala Lys Asp He Glu Thr Thr Glu Gly Ser Pro Glu 
245 250 255 



<210> 69 
<211> 255 
<212> PRT 

<213> Bacillus subtilis 
<400> 69 

Met Lys Arg Glu Ser Asn He Gin Val Leu Ser Arg Gly Gin Lys Asp 
1 5 10 15 

Gin Pro Val Ser Gin He Tyr Gin Val Ser Thr Met Thr Ser Leu Leu 
20 25 30 

Asp Gly Val Tyr Asp Gly Asp Phe Glu Leu Ser Glu He Pro Lys Tyr 
35 40 45 

Gly Asp Phe Gly He Gly Thr Phe Asn Lys Leu Asp Gly Glu Leu He 
50 55 60 

Gly Phe Asp Gly Glu Phe Tyr Arg Leu Arg Ser Asp Gly Thr Ala Thr 
65 70 75 80 

Pro Val Gin Asn Gly Asp Arg Ser Pro Phe Cys Ser Phe Thr Phe Phe 
85 90 ' 95 

Thr Pro Asp Met Thr His Lys He Asp Ala Lys Met Thr Arg Glu Asp 
100 105 110 

Phe Glu Lys Glu He Asn Ser Met Leu Pro Ser Arg Asn Leu Phe Tyr 
115 120 125 
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Ala He 
130 

Glu Leu 
145 



Arg He Asp Gly 



Gin Glu Lys Pro 
150 



Gin Pro He Phe Asn Phe 
165 

Leu Thr Pro Ala Tyr Ala 
180 



Leu Phe Lys Lys Val Gin 
135 140 

Tyr Val Pro Met Val Glu 
155 

Asp Asn Val Arg Gly Thr 
170 

Asn Gly He Ala Val Ser 
185 



Thr Arg Thr Val 



Ala Val Lys Thr 
160 

He Val Gly Phe 
175 

Gly Tyr His Leu 
190 



His Phe He Asp Glu Gly Arg Asn 
195 200 



Ser Gly Gly His 



Val Phe Asp Tyr 
205 



Val Leu 
210 



Glu Asp Cys Thr 



Val Thr 
215 



lie Ser Gin Lys 
220 



Met Asn Met Asn 



Leu Arg Leu Pro Asn Thr Ala Asp 
225 230 



Phe Phe Asn Ala 

235 



Asn Leu Asp Asn 
240 



Pro Asp Phe Ala Lys Asp He Glu 
245 



Thr Thr Glu Gly 
250 



Ser Pro Glu 
255 



<210> 70 

<211> 1254 

<212> DNA 

<2.13> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . (1251) 

<400> 70 

atg aca ttc tec ctt ttt ggt gac aaa ttt acc cgc cac tec ggc att 48 

Met Thr Phe Ser Leu Phe Gly Asp Lys Phe Thr Arg His Ser Gly He 

IS 10 15 

acg ctg ttg atg gaa gat ctg aac gac ggt tta cgc acg cct ggc gcg 96 
Thr Leu Leu Met Glu Asp Leu Asn Asp Gly Leu Arg Thr Pro Gly Ala 
20 25 30 

att atg etc ggc ggc ggt aat ccg gcg cag ate ccg gaa atg cag gac 14 4 
lie Met Leu Gly Gly Gly Asn Pro Ala Gin He Pro Glu Met Gin Asp 
35 40 45 

tac ttc cag acg eta ctg acc gac atg ctg gaa agt ggc aaa gcg act 192 
Tyr Phe Gin Thr Leu Leu Thr Asp Met Leu Glu Ser Gly Lys Ala Thr 
50 55 60 

gat gca ctg tgt aac tac gac ggt cca cag ggg aaa acg gag eta etc 240 
Asp Ala Leu Cys Asn Tyr Asp Gly Pro Gin Gly Lys Thr Glu Leu Leu 
65 70 75 80 



aca ctg ctt gee gga atg ctg cgc gag aag ttg ggt tgg gat ate gaa 288 
Thr Leu Leu Ala Gly Met Leu Arg Glu Lys Leu Gly Trp Asp He Glu 
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85 90 95 

cca cag aat att gca eta aca aac ggc age cag age gcg ttt ttc tac 336 
Pro Gin Asn lie Ala Leu Thr Asn Gly Ser Gin Ser Ala Phe Phe Tyr 
100 105 110 

tta ttt aac ctg ttt gee gga cgc cgt gee gat ggt egg gtc aaa aaa 384 
Leu Phe Asn Leu Phe Ala Gly Arg Arg Ala Asp Gly Arg Val Lys Lys 
115 ~ 120 125 

gtg ctg ttc ccg ctt gca ccg gaa tac att ggc tat get gac gee gga 432 
Val Leu Phe Pro Leu Ala Pro Glu Tyr lie Gly Tyr Ala Asp Ala Gly 
130 135 140 

ctg gaa gaa gat ctg ttt gtc tct gcg cgt ccg aat att gaa ctg ctg 480 
Leu Glu Glu Asp Leu Phe Val Ser Ala Arg Pro Asn lie Glu Leu Leu 
145 150 155 160 

ccg gaa ggc cag ttt aaa tac cac gtc gat ttt gag cat ctg cat att 528 
Pro Glu Gly Gin Phe Lys Tyr His Val Asp Phe Glu His Leu His lie 
165 170 175 

ggc gaa gaa acc ggg atg att tgc gtc tec egg ccg acg aat cca aca 576 
Gly Glu Glu Thr Gly Met lie Cys Val Ser Arg Pro Thr Asn Pro Thr 
180 185 190 

ggc aat gtg att act gac gaa gag ttg ctg aag ctt gac gcg ctg ggc 624 
Gly Asn Val lie Thr Asp Glu Glu Leu Leu Lys Leu Asp Ala Leu Gly 
195 200 205 

aat caa cac ggc att ccg ctg gtg att gat aac get tat ggc gtc ccg 672 
Asn Gin His Gly lie Pro Leu Val He Asp Asn Ala Tyr Gly Val Pro 
210 215 220 

ttc ccg ggt ate ate ttc agt gaa gcg cgc ccg eta tgg aat ccg aat 720 
Phe Pro Gly He He Phe Ser Glu Ala Arg Pro Leu Trp Asn Pro Asn 
225 230 235 240 

ate gtg ctg tgc atg agt ctt tec aag ctg ggt eta cct ggc tec cgc 7 68 
He Val Leu Cys Met Ser Leu Ser Lys Leu Gly Leu Pro Gly Ser Arg 
245 250 255 

tgc ggc att ate ate gee aat gaa aaa ate ate acc gee ate acc aat 816 
Cys Gly He He He Ala Asn Glu Lys He He Thr Ala He Thr Asn 
260 265 270 

atg aac ggc att ate age ctg gca cct ggc ggt att ggt ccg gcg atg 864 
Met Asn Gly He He Ser Leu Ala Pro Gly Gly He Gly Pro Ala Met 
275 280 285 

atg tgt gaa atg att aag cgt aac gat ctg ctg cgc ctg tct gaa aca 912 
Met Cys Glu Met lie Lys Arg Asn Asp Leu Leu Arg Leu Ser Glu Thr 
290 295 300 

gtc ate aaa ccg ttt tac tac cag cgt gtt cag gaa act ate gee ate 960 
Val He Lys Pro Phe Tyr Tyr Gin Arg Val Gin Glu Thr lie Ala He 
305 310 315 320 



WO 01/21772 



PCT/US00/25993 



- 75 - 

att cgc cgc tat tta ccg gaa aat cgc tgc ctg att cat aaa ccg gaa 1008 

lie Arg Arg Tyr Leu Pro Glu Asn Arg Cys Leu lie His Lys Pro Glu 
325 330 335 

gga gcc att ttc etc tgg eta tgg ttt aag gat ttg ccc att acg acc 1056 

Gly Ala lie Phe Leu Trp Leu Trp Phe Lys Asp Leu Pro lie Thr Thr 
340 345 350 

aag cag etc tat cag cgc ctg aaa gca cgc ggc gtg ctg atg gtg ccg 1104 

Lys Gin Leu Tyr Gin Arg Leu Lys Ala Arg Gly Val Leu Met Val Pro 

355 360 365 

ggg cac aac ttc ttc cca ggg ctg gat aaa ccg tgg ccg cat acg cat 1152 

Gly His Asn Phe Phe Pro Gly Leu Asp Lys Pro Trp Pro His Thr His 
370 375 380 

cea tgt atg cgc atg aac tac gta cca gag ccg gag aaa att gag gcg 1200 

Gin Cys Met Arg Met Asn Tyr Val Pro Glu Pro Glu Lys lie Glu Ala 
385 390 395 400 

ggg gtg aag att ctg gcg gaa gag ata gaa aga gcc tgg get gaa agt 124 8 

Gly Val Lys lie Leu Ala Glu Glu lie Glu Arg Ala Trp Ala Glu Ser 
405 410 415 

cac taa 1254 
His 



<210> 71 
<211> 417 
<212> PRT 

<213> Escherichia coli 
<400> 71 

Met Thr Phe Ser Leu Phe Gly Asp Lys Phe Thr Arg His Ser Gly lie 
15 10 15 

Thr Leu Leu Met Glu Asp Leu Asn Asp Gly Leu Arg Thr Pro Gly Ala 
20 25 30 

lie Met Leu Gly Gly Gly Asn Pro Ala Gin He Pro Glu Met Gin Asp 
35 40 45 

Tyr Phe Gin Thr Leu Leu Thr Asp Met Leu Glu Ser Gly Lys Ala Thr 
50 55 60 

Asp Ala Leu Cys Asn Tyr Asp Gly Pro Gin Gly Lys Thr Glu Leu Leu 
65 70 75 80 

Thr Leu Leu Ala Gly Met Leu Arg Glu Lys Leu Gly Trp Asp He Glu 
85 , 90 95 

Pro Gin Asn He Ala Leu Thr Asn Gly Ser Gin Ser Ala Phe Phe Tyr 
100 105 110 

Leu Phe Asn Leu Phe Ala Gly Arg Arg Ala Asp Gly Arg Val Lys Lys 
115 120 125 
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Val Leu Phe Pro Leu Ala Pro Glu Tyr lie Gly Tyr Ala Asp Ala Gly 
130 135 * 140 

Leu Glu Glu Asp Leu Phe Val Ser Ala Arg Pro Asn lie Glu Leu Leu 
145 150 155 160 

Pro Glu Gly Gin Phe Lys Tyr His Val Asp Phe Glu His Leu His lie 
165 * 170 175 

Gly Glu Glu Thr Gly Met lie Cys Val Ser Arg Pro Thr Asn Pro Thr 
180 185 190 

Gly Asn Val lie Thr Asp Glu Glu Leu Leu Lys Leu Asp Ala Leu Gly 
195 * 200 205 

Asn Gin His Gly lie Pro Leu Val lie Asp Asn Ala Tyr Gly Val Pro 
210 215 220 

Phe Pro Gly lie lie Phe Ser Glu Ala Arg Pro Leu Trp Asn Pro Asn 
225 230 235 240 

lie Val Leu Cys Met Ser Leu Ser Lys Leu Gly Leu Pro Gly Ser Arg 
245 250 255 

Cys Gly He He He Ala Asn Glu Lys He He Thr Ala He Thr Asn 
260 265 270 

Met Asn Gly He He Ser Leu Ala Pro Gly Gly He Gly Pro Ala Met 
275 200 285 

Met Cys Glu Met He Lys Arg Asn Asp Leu Leu Arg Leu Ser Glu Thr 
290 295 300 

Val He Lys Pro Phe Tyr Tyr Gin Arg Val Gin Glu Thr He Ala He 
305 310 315 320 

lie Arg Arg Tyr Leu Pro Glu Asn Arg Cys Leu He His Lys Pro Glu 
325 330 335 

Gly Ala He Phe Leu Trp Leu Trp Phe Lys Asp Leu Pro He Thr Thr 
340 345 350 

Lys Gin Leu Tyr Gin Arg Leu Lys Ala Arg Gly Val Leu Met Val Pro 
355 360 365 

Gly His Asn Phe Phe Pro Gly Leu Asp Lys Pro Trp Pro His Thr His 
370 375 ~ 380 

Gin Cys Met Arg Met Asn Tyr Val Pro Glu Pro Glu Lys He Glu Ala 
385 390 395 400 



Gly Val Lys He Leu Ala Glu Glu He Glu Arg Ala Trp Ala Glu Ser 
405 410 415 



His 
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<210> 72 
<211> 8803 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Recombinant 
pAN294 plasmid 

<400> 72 



tgcgccgcta 


cagggcgcgt 


ccattcgcca 


ttcaggctgc 


gcaactgttg 


ggaagggcga 


60 


tcggtgcggg 


cctcttcgct 


attacgccag 


ctggcgaaag 


ggggatgtgc 


tgcaaggcga 


120 


ttaagttggg 


taacgccagg 


gttttcccag 


tcacgacgtt 


gtaaaacgac 


ggccagtgaa 


180 


ttgtaatacg 


actcactata 


gggcgaattg 


ggcccgacgt 


cgcatgctgg 


atgaaaagcc 


240 


gatgaccgct 


tttcaggtct 


gtcagcagct 


ttttcctgct 


gtatatgaaa 


aggaattgtt 


300 


tttaacgatg 


tcagaaacgg 


caggtcacct 


tgatgtgttg 


gaggctgaag 


aagccatcac 


360 


gtcatattgg 


gaaggaaata 


ccgtatactt 


taaaacaatg 


aagaggtgaa 


atgggtgaaa 


420 


catatagcgg 


gaaaaaggat 


ttggataacc 


ggcgcttcag 


gagggcttgg 


agaaagaatc 


480 


gcatacttat 


gcgcggctga 


aggagcccat 


gtcctgctgt 


cggctagacg 


cgaggatcgt 


540 


ttgatagaaa 


tcaaaaggaa 


aataaccgag 


gaatggagcg 


gacagtgtga 


gatttttcct 


600 


ctggatgtcg 


gccgcctaga 


ggatatcgcc 


cgggtccgcg 


atcagatcgg 


ctcgattgat 


660 


gtactgatta 


acaatgcagg 


cttcggtata 


tttgaaacgg 


ttttagactc 


tacattggat 


720 


gacatgaaag 


cgatgtttga 


tgtgaatgtc 


ttcggcctga 


tcgcctgtac 


aaaagcggtg 


780 


cttccgcaaa 


tgcttgagca 


aaaaaaggga 


catatcatca 


atatcgcctc 


tcaagcgggg 


840 


aaaatcgcca 


caccgaagtc 


tagcctgtat 


tccgcgacca 


aacatgccgt 


gttaggttac 


900 


tcaaacgctt 


tgcggatgga 


gctttcggga 


accggcattt 


atgtgacaac 


agtcaacccg 


960 


ggcccgattc 


agacggactt 


tttttccatt 


gctgataaag 


gcggggacta 


cgccaaaaat 


1020 


gtcggccgct 


ggatgcttga 


tcctgatgac 


gtggcagctc 


aaattacagc 


tgcaattttt 


1080 


acgaaaaagc 


gggagatcaa 


tcttccgcgt 


ttaatgaatg 


ccggcactaa 


gctgtatcag 


1140 


ctgtttccag 


ctcttgtaga 


aaagctggca 


ggacgcgcgc 


tcatgaaaaa 


ataatgatag 


1200 


aactgcctgt 


ggtggagtgg 


cttgtttctc 


acggggcagt 


ttttgatagt 


ggaagggaga 


1260 


gattgttgaa 


tgtcagttca 


ttcagaagtc 


cttcatgctc 


tgcttaaaga 


tccgtttatt 


1320 


cagaaactga 


ttgatgcaga 


gcctgtattc 


tgggcaaatt 


caggcaagaa 


agaggggcca 


1380 


ttaccccgtg 


cagatgagtg 


ggcaaccgag 


atagcggaag 


cggaaaaaag 


aatgcagcgg 


1440 
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tttgcacctt 


acattgccga 


ggtgtttcct 


gagacgaaag 


gcgctaaagg 


aatcatcgag 


1500 


tctccgcttt 


ttgaggtgca 


gcatatgaag 


ggaaagctgg 


aagcggcata 


tcagcagcca 


1560 


tttcccggaa 


gatggctttt 


aaagtgcgac 


catgagcttc 


cgatttcagg 


atcgattaaa 


1620 


gcgaggggcg 


ggatttatga 


agtgttaaag 


tatgctgaaa 


atctcgcgct 


tcaagaagga 


1680 


atgcttcagg 


aaaccgatga 


ttaccgcatc 


ttacaggaag 


agcggtttac 


cgggtttttc 


1740 


tcccgctatt 


cgattgctgt 


cggttcgaca 


ggaaatctag 


gtttaagcat 


cggcatcatc 


1800 


ggcgcggcac 


tcgggtttcg 


cgtgacagtg 


catatgtccg 


ccgatgctaa 


gcagtggaaa 


1860 


aaggatctcc 


tccgccaaaa 


gggagtcact 


gttatggagt 


acgaaacaga 


ttacagtgaa 


1920 


gcggtgaacg 


aagggagacg 


gcaggcggaa 


caagatccat 


tctgttattt 


tattgatgat 


1980 


gaacattctc 


gtcagctgtt 


cttaggatat 


gctgttgctg 


caagccgatt 


aaaaacacag 


2040 


cttgactgta 


tgaatataaa 


gccaagtctt 


gagacgccct 


tgtttgtgta 


tctgccgtgc 


2100 


ggagtcggcg 


gaggaccggg 


cggtgtagca 


tttgggctga 


agcttttata 


cggagatgat 


2160 


gttcatgtgt 


ttttcgcaga 


accaactcat 


tcaccttgta 


tgctgttagg 


gctttattca 


2220 


ggacttcacg 


agaagatctc 


cgtccaggat 


atcggcctgg 


ataatcagac 


ggctgctgac 


2280 


ggacttgccg 


tagggaggcc 


gtcaggattt 


gtcggcaagc 


tgattgaacc 


gcttctgagc 


2340 


ggctgttata 


cggtagagga 


caatacgctt 


tatactttgc 


ttcatatgct 


ggctgtatct 


2400 


gaagataaat 


atttagagcc 


ctctgctctt 


gctggcatgt 


tcgggccggt 


tcagcttttt 


2460 


tcgacagaag 


agggaaggcg 


ctatgctcag 


aaatataaga 


tggaacatgc 


cgtacatgtc 


2520 


gtctggggaa 


cgggaggaag 


catggttcca 


aaagatgaaa 


tggctgcgta 


taaccgaatc 


2580 


ggtgctgatt 


tgctaaaaaa 


acgaaatgga 


aaataagcag 


acagtgaaaa 


ggttttccgt 


2640 


tacaatcttt 


gtaagggttt 


taacctacag 


agagtcaggt 


gtaaacagtg 


aaaaataaag 


2700 


aacttaacct 


acatacttta 


tatacacagc 


acaatcggga 


gtcttggtct 


ggttttgggg 


2760 


ggcatttgtc 


gattgctgta 


tctgaagaag 


aggcaaaagc 


tgtggaagga 


ttgaatgatt 


2820 


atctatctgt 


tgaagaagtg 


gagacgatct 


atattccgct 


tgttcgcttg 


cttcatttac 


2880 


atgtcaagtc 


tgcggctgaa 


cgcaataagc 


atgtcaatgt 


ttttttgaag 


cacccacatt 


2940 


cagccaaaat 


tccgtttatt 


atcggcattg 


ccggcagtgt 


cgcagtcgga 


aaaagcacga 


3000 


cggcgcggat 


cttgcagaag 


ctgctttcgc 


gtttgcctga 


ccgtccaaaa 


gtgagcctta 


3060 


tcacgacaga 


tggtttttta 


tttcctactg 


ccgagctgaa 


aaagaaaaat 


atgatgtcaa 


3120 


gaaaaggatt 


tcctgaaagc 


tatgatgtaa 


aggcgctgct 


cgaatttttg 


aatgacttaa 


3180 
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aatcaggaaa 


ggacagcgta 


aaggccccgg 


tgtattccca 


tctaacctat 


gaccgcgagg 


3240 


aaggtgtgtt 


cgaggttgta 


gaacaggcgg 


atattgtgat 


tattgaaggc 


attaatgttc 


3300 


ttcagtcgcc 


caccttggag 


gatgaccggg 


aaaacccgcg 


tatttttgtt 


tccgatttct 


3360 


ttgatttttc 


gatttatgtg 


gatgcggagg 


aaagccggat 


tttcacttgg 


tatttagagc 


3420 


gttttcgcct 


gcttcgggaa 


acagcttttc 


aaaatcctga 


ttcatatttt 


cataaattta 


3480 


aagacttgtc 


cgatcaggag 


gctgacgaga 


tggcagcctc 


gatttgggag 


agtgtcaacc 


3540 


ggccgaattt 


atatgaaaat 


attttgccaa 


ctaaattcag 


gtcagatctc 


attttgcgta 


3600 


agggagacgg 


gcataaggtc 


gaggaagtgt 


tggtaaggag 


ggtatgaaat 


gtgctgcagc 


3660 


tcgagcaata 


gttaccctta 


ttatcaagat 


aagaaagaaa 


aggatttttc 


gctacgctca 


3720 


aatcctttaa 


aaaaacacaa 


aagaccacat 


tttttaatgt 


ggtctttatt 


cttcaactaa 


3780 


agcacccatt 


agttcaacaa 


acgaaaattg 


gataaagtgg 


gatattttta 


aaatatatat 


3840 


ttatgttaca 


gtaatattga 


cttttaaaaa 


aggattgatt 


ctaatgaaga 


aagcagacaa 


3900 


gtaagcctcc 


taaattcact 


ttagataaaa 


atttaggagg 


catatcaaat 


gaactttaat 


3960 


aaaat.tgatt 


tagacaattg 


gaagagaaaa 


gagatattta 


atcattattt 


gaaccaacaa 


4020 


acgactttta 


gtataaccac 


agaaattgat 


attagtgttt 


tataccgaaa 


cataaaacaa 


4080 


gaaggatata 


aattttaccc 


tgcatttatt 


ttcttagtga 


caagggtgat 


aaactcaaat 


4140 


acagctttta 


gaactggtta 


caatagcgac 


ggagagttag 


gttattggga 


taagttagag 


4200 


ccactttata 


caatttttga 


tggtgtatct 


aaaacattct 


ctggtatttg 


gactcctgta 


4260 


aagaatgact 


tcaaaga'gtt 


ttatgattta 


tacctttctg 


atgtagagaa 


atataatggt 


4320 


tcggggaaat 


tgtttcccaa 


aacacctata 


cctgaaaatg 


ctttttctct 


ttctattatt 


4380 


ccatggactt 


catttactgg 


gtttaactta 


aatatcaata 


ataatagtaa 


ttaccttcta 


4440 


cccattatta 


cagcaggaaa 


attcattaat 


aaaggtaatt 


caatatattt 


accgctatct 


4500 


ttacaggtac 


atcattctgt 


ttgtgatggt 


tatcatgcag 


gattgtttat 


gaactctatt 


4560 


caggaattgt 


cagataggcc 


taatgactgg 


cttttataat 


atgagataat 


gccgactgta 


4620 


ctttttacag 


tcggttttct 


aatgtcacta 


acctgccccg 


ttagttgaag 


aaggttttta 


4680 


tattacagct 


gtcgactcgt 


gatcttcgga 


caggctgttc 


agctttttct 


caatgcgatc 


4740 


cagctgcgct 


tttcggtttt 


tcgcatactt 


gaagcctgta 


acagccgcaa 


agacgacagc 


4800 


ggcaaatata 


ataaatacaa 


acagctgaaa 


catcacatca 


cctatattca 


tgttcttcac 


4860 


ctcatgtttg 


cgggagagat 


tcattctctt 


ccgtttttta 


tttaaagcgg 


cttttccaga 


4920 
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cgggaacggt 


gttttgtggt 


ctccattttc 


atttgccgat 


aggcgaacgc 


taaaaatggc 


4980 


aggccgagca 


gggtaatgcc 


gctcaggaca 


gaaaaaatat 


aaatcggccg 


gccagcgcca 


5040 


aacaggtcta 


tacatatccc 


cccgacccaa 


gggccgatga 


cgtttccgag 


ctgtggaaaa 


5100 


ccgattgccc 


cgaaataagt 


gccttttaat 


cctggttttg 


caatctggtc 


tacatacaaa 


5160 


tccatcatag 


agaataaaag 


cacttcgccg 


attgtaaatg 


tgatgacaat 


catcacaatt 


5220 


gatggaacac 


cgtgtgatac 


ggtgaaaatg 


gccatgctga 


tgctaaccat 


cacattaccg 


5280 


agcatcagag 


aacaaagcgg 


cgaaaaccgt 


tttgcaaaat 


ggacaatggg 


aaattgcgtc 


5340 


gccaacacaa 


cgattgcgtt 


taatgtcagc 


atcagcccat 


acagcttcgt 


tccattgccg 


5400 


atcaaggggt 


tctgcgccat 


atactgaggg 


aatgtggaac 


tgaattgtga 


gtagccgaag 


5460 


gtgcatagcg 


taatgccgac 


caaagcaatg 


gtaaaaagat 


aatccttttg 


cgtgaccata 


5520 


aacgcttccc 


gcacgctcat 


atttcgggac 


tgggctggtg 


ctgataagga 


tggatgtttt 


5580 


ttaaattgga 


gggcaagcac 


aattccgtat 


agtccgtaaa 


tgactgcagg 


caccaaaaag 


5640 


ggcgtagtcg 


attgcgatga 


gccgaaatat 


aggccaagca 


caggtccgaa 


gacaacgccg 


5700 


atattaatag 


ccgcatagcg 


taaattaaaa 


actagcagtc 


tcgttttttc 


ttctgtcata 


5760 


tcagacaaca 


aggcctttga 


agcgggctca 


aacagtgatt 


tgcaaagacc 


gtttaatgcg 


5820 


tttactacaa 


aaaacaccca 


gagattagat 


gctgccgcaa 


agcctgcaaa 


taccagcatc 


5880 


catccgaaaa 


tcgatacaag 


catcatgttt 


tttctgccga 


atttatctga 


gatatatccg 


5940 


ccgtaaaagc 


ttgcgaggat 


gccgactgat 


gagctcgcgg 


cgatgaccag 


ccctgcatag 


6000 


gaagctgatg 


cgccttggac 


ggctgtcaaa 


taaatcgcta 


aaaaaggaat 


gctcatcgat 


6060 


gttgccattc 


tgccgaaaat 


ggttccgatt 


ataattgtac 


gcgttggatg 


catagcttga 


6120 


gtattctata 


gtgtcaccta 


aatagcttgg 


cgtaatcatg 


gtcatagctg 


tttcctgtgt 


6180 


gaaattgtta 


tccgctcaca 


attccacaca 


acatacgagc 


cggaagcata 


aagtgtaaag 


6240 


cctggggtgc 


ctaatgagtg 


agctaactca 


cattaattgc 


gttgcgctca 


ctgcccgctt 


6300 


tccagtcggg 


aaacctgtcg 


tgccagctgc 


attaatgaat 


cggccaacgc 


gcggggagag 


6360 


gcggtttgcg 


tattgggcgc 


tcttccgctt 


cctcgctcac 


tgactcgctg 


cgctcggtcg 


6420 


ttcggctgcg 


gcgagcggta 


tcagctcact 


caaaggcggt 


aatacggtta 


tccacagaat 


6480 


caggggataa 


cgcaggaaag 


aacatgtgag 


caaaaggcca 


gcaaaaggcc 


aggaaccgta 


6540 


aaaaggccgc 


gttgctggcg 


tttttcgata 


ggctccgccc 


ccctgacgag 


catcacaaaa 


6600 


atcgacgctc 


aagtcagagg 


tggcgaaacc 


cgacaggact 


ataaagatac 


caggcgtttc 


6660 
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cccctggaag 


ctccctcgtg 


cgctctcctg 


ttccgaccct 


gccgcttacc 


ggatacctgt 


6720 


ccgcctttct 


cccttcggga 


agcgtggcgc tttctcatag ctcacgctgt 


aggtatctca 


6780 


gttcggtgta 


ggtcgttcgc 


tccaagctgg 


gctgtgtgca 


cgaacccccc 


gttcagcccg 


6840 


accgctgcgc 


cttatccggt 


aactatcgtc ttgagtccaa 


cccggtaaga 


cacgacttat 


6900 


cgccactggc 


agcagccact 


ggtaacagga 


ttagcagagc 


gaggtatgta 


ggcggtgcta 


6960 


cagagttctt 


gaagtggtgg 


cctaactacg 


gctacactag 


aaggacagta 


tttggtatct 


7020 


gcgctctgct 


gaagccagtt 


accttcggaa 


aaagagttgg 


tagctcttga 


tccggcaaac 


7080 


aaaccaccgc 


tggtagcggt 


ggtttttttg 


tttgcaagca 


gcagattacg 


cgcagaaaaa 


7140 


aaggatctca 


agaagatcct 


ttgatctttt 


ctacggggtc 


tgacgctcag 


tggaacgaaa 


7200 


actcacgtta 


agggattttg 


gtcatgagat 


tatcaaaaag 


gatcttcacc 


tagatccttt 


7260 


taaattaaaa 


atgaagtttt 


aaatcaatct 


aaagtatata 


tgagtaaact 


tggtctgaca 


7320 


gttaccaatg 


cttaatcagt 


gaggcaccta 


tctcagcgat 


ctgtctattt 


cgttcatcca 


7380 


tagttgcctg 


actccccgtc 


gtgtagataa 


ctacgatacg 


ggagggctta 


ccatctggcc 


7440 


ccagtgctgc 


aatgataccg 


cgagacccac 


gctcaccggc 


tccagattta 


tcagcaataa 


7500 


accagccagc 


cggaagggcc 


gagcgcagaa 


gtggtcctgc 


aactttatcc 


gcctccatcc 


7 560 


agtctattaa 


ttgttgccgg 


gaagctagag 


taagtagttc 


gccagttaat 


agtttgcgca 


7620 


acgttgttgg 


cattgctaca 


ggcatcgtgg 


tgtcacgctc 


gtcgtttggt 


atggcttcat 


7680 


tcagctccgg 


ttcccaacga 


tcaaggcgag 


ttacatgatc 


ccccatgttg 


tgcaaaaaag 


7740 


cggttagctc 


cttcggtcct 


ccgatcgttg 


tcagaagtaa 


gttggccgca 


gtgttatcac 


7800 


tcatggttat 


ggcagcactg 


cataattctc 


ttactgtcat 


gccatccgta 


agatgctttt 


7860 


ctgtgactgg 


tgagtactca 


accaagtcat 


tctgagaata 


ccgcgcccgg 


cgaccgagtt 


7920 


gctcttgccc 


ggcgtcaata 


cgggataata 


gtgtatgaca 


tagcagaact 


ttaaaagtgc 


7980 


tcatcattgg 


aaaacgttct 


tcggggcgaa 


aactctcaag 


gatcttaccg 


ctgttgagat 


8040 


ccagttcgat 


gtaacccact 


cgtgcaccca 


actgatcttc 


agcatctttt 


actttcacca 


8100 


gcgtttctgg 


gtgagcaaaa 


acaggaaggc 


aaaatgccgc 


aaaaaaggga 


ataagggcga 


8160 


cacggaaatg 


ttgaatactc 


atactcttcc 


tttttcaata 


ttattgaagc 


atttatcagg 


8220 


gttattgtct 


catgagcgga 


tacatatttg 


aatgtattta 


gaaaaataaa 


caaatagggg 


8280 


ttccgcgcac 


atttccccga 


aaagtgccac 


ctgtatgcgg 


tgtgaaatac 


cgcacagatg 


8340 


cgtaaggaga 


aaataccgca 


tcaggcgaaa 


ttgtaaacgt 


taatattttg 


ttaaaattcg 


8400 
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cgttaaatat 


ttgttaaatc agctcatttt 


ttaaccaata 


ggccgaaatc 


ggcaaaatcc 


8460 


cttataaatc 


aaaagaatag 


accgagatag 


ggttgagtgt 


tgttccagtt 


tggaacaaga 


8520 


gtccactatt 


aaagaacgtg 


gactccaacg 


tcaaagggcg 


aaaaaccgtc 


tatcagggcg 


8580 


atggcccact 


acgtgaacca 


tcacccaaat 


caagtttttt 


gcggtcgagg 


tgccgtaaag 


8640 


ctctaaatcg 


gaaccctaaa 


gggagccccc gatttagagc 


ttgacgggga 


aagccggcga 


8700 


acgtggcgag 


aaaggaaggg 


aagaaagcga 


aaggagcggg 


cgctagggcg 


ctggcaagtg 


8760 


tagcggtcac 


gctgcgcgta 


accaccacac 


ccgccgcgct 


taa 




8803 



<210> 73 
<211> 8320 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Recombinant 
pAN2 96 plasmid 

<400> 73 



tgcgccgcta 


cagggcgcgt 


ccat tcgcca 


ttcaggctgc 


gcaactgttg 


ggaagggcga 


60 


tcggtgcggg 


cctcttcgct 


attacgccag 


ctggcgaaag 


ggggatgtgc 


tgcaaggcga 


120 


ttaagttggg 


taacgccagg 


gttttcccag 


tcacgacgtt 


gtaaaacgac 


ggccagtgaa 


180 


ttgtaatacg 


actcactata 


gggcgaattg 


ggcccgacgt 


cgcatgctgg 


atgaaaagcc 


240 


gatgaccgct 


tttcaggtct 


gtcagcagct 


ttttcctgct 


gtatatgaaa 


aggaattgtt 


300 


tttaacgatg 


tcagaaacgg 


caggtcacct 


tgatgtgttg 


gaggctgaag 


aagccatcac 


360 


gtcatattgg 


gaaggaaata 


ccgtatactt 


taaaacaatg 


aagaggtgaa 


atgggtgaaa 


420 


catatagcgg 


gaaaaaggat 


ttggataacc 


ggcgcttcag 


gagggcttgg 


agaaagaatc 


480 


gcatacttat 


gcgcggctga 


aggagcccat 


gtcctgctgt 


cggctagacg 


cgaggatcgt 


540 


ttgatagaaa 


tcaaaaggaa 


aataaccgag 


gaatggagcg 


gacagtgtga 


gatttttcct 


600 


ctggatgtcg 


gccgcctaga 


ggatatcgcc 


cgggtccgcg 


atcagatcgg 


ctcgattgat 


660 


gtactgatta 


acaatgcagg 


cttcggtata 


tttgaaacgg 


ttttagactc 


tacattggat 


720 


gacatgaaag 


cgatgtttga 


tgtgaatgtc 


ttcggcctga 


tcgcctgtac 


aaaagcggtg 


780 


cttccgcaaa 


tgcttgagca 


aaaaaaggga 


catatcatca 


atatcgcctc 


tcaagcgggg 


840 


aaaatcgcca 


caccgaagtc 


tagcctgtat 


tccgcgacca 


aacatgccgt 


gttaggttac 


900 


tcaaacgctt 


tgcggatgga 


gctttcggga 


accggcattt 


atgtgacaac 


agtcaacccg 


960 


ggcccgattc 


agacggactt 


tttttccatt 


gctgataaag 


gcggggacta 


cgccaaaaat 


1020 
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gtcggccgct ggatgcttga 
acgaaaaagc gggagatcaa 
ctgtttccag ctcttgtaga 
aactgcctgt ggtggagtgg 
gattgttgaa tgtcagttca 
cagaaactga ttgatgcaga 
ttaccccgtg cagatgagtg 
tttgcacctt acattgccga 
tctccgcttt ttgaggtgca 
tttcccggaa gatggctttt 
gcgaggggcg ggatttatga 
atgcttcagg aaaccgatga 
tcccgctatt cgattgctgt 
ggcgcggcac tcgggtttcg 
aaggatctcc tccgccaaaa 
gcggtgaacg aagggagacg 
gaacattctc gtcagctgtt 
cttgactgta tgaatataaa 
ggagtcggcg gaggaccggg 
gttcatgtgt ttttcgcaga 
ggacttcacg agaagatctc 
ggacttgccg tagggaggcc 
ggctgttata cggtagagga 
gaagataaat atttagagcc 
tcgacagaag agggaaggcg 
gtctggggaa cgggaggaag 
ggtgctgatt tgctaaaaaa 
tacaatcttt gtaagggttt 
aacttaacct acatacttta 
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tcctgatgac 


qtqgcagctc 


aaattacagc 


tgcaattttt 


1080 


tcttccgcgt 


ttaatgaa tg 


ccggcactaa 


gctgtatcag 


1140 


aaagctggca 


qqacqcqcqc 


tcatgaaaaa 


ataatgatag 


1200 


cttgtttctc 


acqqqqcaqt 


ttttgatagt 


qqaaqqqaqa 


1260 


ttcagaagtc 


ct teat get c 


tgett aaaga 


teegtttatt 


1320 


gcctgtattc 


tgggcaaat t 


caggcaagaa 


agaggggeca 


1380 


a aca a cc a acr 


ataacaaaaa 


eggaaaaaag 


aataeaaecjcj 

CA u l» ^ w C4 \J w y V4 


14 40 


ggtgt 1 1 cct 


gagacgaaa g 


gegctaaagg 


aatcatccraa 


1500 


acatatcraacr 

**** ta ^ vj a u m 


ooaaaacf na 


aageggea ta 


t" pa ffP A cjpp a 

*— * — CI M CI \_> Ct 


1560 


aaaotocoac 

CI d Q M Li u a V** 


catoaoctt p 

ca i_ \j a w c w 


raatttcaoo 

v — * UO L L L W» CI VJ v J 


ahraat t~ a a a 

CT l_ V_J CI L. L_ CI CI CI 


1 620 


arr1~n1~i~AaarT 


1~,at"rfpt"n,^^a 

LdL<jv>Lyada 


At'pt'pnprir't" 




1 680 


i_ i— a \-j v— n *-v- 




a cjpejat 1 1 ar 




.17 40 


racittcaara 

v_j i_ \_ a v-r 


0(73 A A t" Pt" AO* 


n1"t"t"aanrai" 


rnnralra hp 


1800 


pfjtcjapaofci 

^ >j y a \_ci \^ c. y 


a l a l vj i_ ^ — 


L!rcrat"firt a a 

>— u l y u l a a 


npant*nnaaa 

y Vw^ci y Ly y uuu 


1860 


acfaaatcact 

VJ C* C* V-> l_ 


attatcnact 


accaaacaoa 

C* VJ C* O C* ' CI ^ CI 


ttacaotjaa 


1920 


gcaggcggaa 


caagatcca t. 


tctgttattt 


tattgatgat 


1980 


cttaaaatat 




caagecgat t 


aaaaacacaa 

Q Ci CJ Ca O CI Ci VJ 


2040 


gccaagtctt 


gagacgccct 


tgtttgtgta 


tetgecg tgc 


2100 


cqqtq tacjca 


1 1 tqqqctqa 


agcttttata 


cqqaqatqat 


2160 


accaactcat 


tcacct tgta 


tqctqttaqq 


gctttattca 


2220 


cgtccaggat 


atcqqcctqq 


ataatcagac 


qqctqctqac 


2280 


gtcaggattt 


gteggcaage 


tgattgaacc 


gcttctgagc 


2340 


caatacgctt 


tatactttgc 


ttcatatget 


ggctgtatct 


2400 


ctctgctctt 


gctggcatgt 


tegggceggt 


tcagcttttt 


2460 


ctatgctcag 


aaatataaga 


tggaacatgc 


cgtacatgtc 


2520 


catggttcca 


aaagatgaaa 


tggctgcgta 


taaccgaatc 


2580 


acgaaatgga 


aaataagcag 


acagtgaaaa 


ggttttccgt 


2640 


taacctacag 


agagtcaggt 


gtaaacagtg 


aaaaataaag 


2700 


tatacacagc 


acaateggga 


gtcttctgea 


gctcgagcaa 


2760 
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tagttaccct 


tattatcaag 


ataagaaaga 


aaaggatttt 


tcgctacgct 


caaatccttt 


2820 


aaaaaaacac 


aaaagaccac 


attttttaat 


gtggtcttta 


ttcttcaact 


aaagcaccca 


2880 


t tagttcaac 


aaacgaaaat 


tggataaagt 


gggatatttt 


taaaatatat 


atttatgtta 


2940 


cagtaatatt 


gacttttaaa 


aaaggattga 


ttctaatgaa 


gaaagcagac 


aagtaagect 


3000 


cctaaa t t ca 


cttt agataa 


aaatttagga 


ggcatatcaa 


atgaacttta 


a taaaattga 


3060 


tttaaacaat 


taaaaaaoaa 


aagagatatt 


t aatcat ta t 


ttgaaccaac 


aaacgactt t 


3120 


fantataarr* 


;="r*;"tn;i^atta 
uL.c3y aua l l y 


a t at t ant at* 
ci v ci l. Lay ^ y 


1 1 t" at arroa 

L L L C3 L ULLy Q 


aarafaaaar 

a a La LouaaL 


aayaauy a La 


3180 


LuQu L L » — L O L 


f pf rif^t - t t*3 

L>^> U^^<u L L 1— a 


ttttcttaat 

v — i — ' — l u u u ay l 


nzicpizi rrno t n 

y uL>uay yy y 


ataaarfcaa 

a l uaa l u Laa 


ataraarttt 

a l a \— # a y v— l l l 


3240 


Lay aa^ l y y l 


LaLact Lay 


arnnanant'i' 

aLyyayay l l 


arrnttat tnn 
□ y y l lql Lyy 


rr A t~ ^ ^ nt" t~ a n 
y a l a a y l Lay 


ci y l l a l l l La 


3300 


t-araart- 1 1 1 

tdLQa L L L L L 


yci uy y l y l a l 


L. LdaaaL-a L L 


u LLtyy La l l 


Lyy ul LLLLy 


L a a a y a a l y a 


3360 


l l LLaooy ciy 


1 1 1 tataat" t 

L La tyai t 


t~;=it"af > pt*ttr'' 

Lu LOUL. 1. L L L 


tnatntanan 
l y q l y Lay ay 


aaatataafn 


nttrnnnrraa 

y L LLyyyyaa 


3420 


p\ \~ t n f~ f t* r** f^r* 
d. l i« y u l t LL-U 


aciciciLciLL La. 


LaL-L- Lyuctaa 


t" net ttt-f rt 

LyULLLL ILL 


r fff ptat t a 

LL L L LLu L Lu 


l l l La l y y a. l 


34 80 




y y y l l uaaLL 


v. a cx cl Lu LLaa 


tAAt^Atarrt 
L a a Laa Lay L 


da i. LaLL L LL 


LuLLLa L La L 


3540 


t a r* ;*} ri f"* nfTA 


aaat"fpat"ra 

uuu l Lua l. l a 


a Laaay y l aa 


ttraatatat 

L LLaaLuLUl. 


ttarrartat 

L LOLLy^La L 


r> f t" tsfflrrni" 

l. l l l a l a y y l 


3600 


acatcattrfc 


nt" ttntnflt"n 
y l u ^y uy a uy 


y l La llu Ly ^ 


aocrattat 1 t 

ci y y d L L y L L L 


at as^ctcts 

a L y G Cl L L V— l a 


t* t" raaaaat t" 

l l l a y y a a l l 


3660 


atcaaataaa 

y L ^ a y v^l l^, CA y y 


ccta ataact 

\~r W ^— CI C* W <-A Vnr C. 


aacttttata 


atatciaaata 

C* ca y ci y Ci w ca 


ataccaacta 

Cl W ^4 W C* V-i» L y 


tacttt ttac 


3720 


a at eaat 1 1 1 


ctaatatcac 


taacctaccc 


catt aattaa 


agaaggtttt 


ta ta t tacag 


3780 


ctgtcgacta 


aggt cgagga 


agtg ttggta 


aggagggtat 


gaaatgtgca 


t catat tgaa 


3840 


ctgtatgtct 


ct gatttgga 


qqcqtctaqq 


cqqt tttqqq 


gctggttctt 


aaaagaactt 


3900 


ggttataaag 


agtatcaaaa 


atggagctca 


ggcatcagct 


qqaaqaaaqa 


tegtttttae 


3960 


ctagtgattg 


tgcaggcgaa 


agagccattt 


etaqaqcegg 


aataccatag 


atgccgagtc 


4020 


ggtctgaacc 


atctcgcatt 


tcatgctgaa 


tccaagcttc 


aagtcgatca 


gatgactgaa 


4080 


aaattgacgg 


caaaaggcta 


tcgtgtgttg 


taccgagaca 


ggcatccttt 


tgccggagga 


4140 


gacgggcatt 


atgcagtctt 


ttgtgaggat 


ccagaccgga 


ttaaggtaga 


gctcgttgcc 


4200 


ccaagctgtt 


aatcgtgatc 


ttcggacagg 


ctgttcagct 


ttttctcaat 


gcgatccagc 


4260 


tgcgcttttc 


ggtttttcgc 


atacttgaag 


cctgtaacag 


ccgcaaagac 


gaeageggea 


4320 


aatataataa 


atacaaacag 


ctgaaacatc 


acatcaccta 


tattcatgtt 


cttcacctca 


4380 


tgtttgcggg 


agagattcat 


tctcttccgt 


tttttattta 


aagcggcttt 


tecagaeggg 


4440 


aacggtgttt 


tgtggtctcc 


attttcattt 


gecgatagge 


gaacgctaaa 


aatggcaggc 


4500 
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cgagcagggt 


aatgccgctc 


aggacagaaa 


aaatataaat 


cggccggcca 


gcgccaaaca 


4560 


ggtctataca 


tatccccccg 


acccaagggc 


cgatgacgtt 


tccgagctgt 


ggaaaaccga 


4620 


ttgccccgaa 


ataagtgcct 


tttaatcctg 


gttttgcaat 


ctggtctaca 


tacaaatcca 


4680 


tcatagagaa 


taaaagcact 


tcgccgattg 


taaatgtgat 


gacaatcatc 


acaattgatg 


4740 


gaacaccgtg 


taatacaata 


aaaatggcca 


tgctgatgct 


aaccatcaca 


ttaccgagca 


4800 


tcagagaaca 


aagcggcgaa 


aaccgttttg caaaatggac 


aatgggaaat 


tgcgtcgcca 


4860 


acacaacgat 


tgcgtttaat 


gtcagcatca 


gcccatacag 


cttcgt fccca 


t tgccgatca 


4920 


aggggtt ctg 


cgccatatac 


tgagggaatg 


tggaactgaa 


t tgtgagt ag 


ccgaaggtgc 


498C 


atagcgt aat 


gccgaccaaa 


gcaatggtaa 


aaagataatc 


cttttgcgtg 


accataaacg 


5040 


ct t cccaceic 

\^ *— ' w M w C4 w 


act ratattt 

^ v_ \^ a c a c c c 


cgggactggg 


ctggtgctga 


taaacratcrcra 

C* a d. L, M ci 


tattttttaa 

c ^ c c c c c c c* a 


5100 


a u v# y cjl \j v^* 


aaacacaatt 

C*. CA V.J V-» C* w CI CI C C 


ccgtatagtc 


cgtaaatgac 


t gcaggcacc 


Ci d d O C* \M UUU 


5160 


taatroa t ta 


w ci c \u a v> w 


aaatataggc 


caagcacagg 


tpcaaaoaca 

Ct C* w c^ c* c* \M c* v— • c* 


acorcaatat - 

C4 ^ C« w *J Ci C C* C> 


5220 


taa t agccgc 




ttaaaaacta 


gcagtctcgt 


tttttcttct 

C C C- C- C- Vp* c» V c 


gt ca ta t cacf 


5280 


V_, CJ U ^ ^ *w 


cttt oaaaca 

^ t l. vj ci ci \M V— ' cj 


ggctcaaaca 


gtgatttgca 


aaaaccottt 

Ci CI \ J d C-» y c. c c 


aat orot tta 

CJ Ci c y V.* V-j C C C CI 


5340 


c tacaaaaaa 


cacccagaga 


ttagatgctg 


ccgcaaagcc 


tgcaaatacc 


agcat cca tc 


5400 


cgaaaatcga 


tacaagcat c 


atgttttttc 


tgccgaattt 


atctgaga ta 


tat ccgccgt 


5460 


aaaagct tgc 


gagga tgccg 


actgatgagc 


tcgcggcgat 


gaccagccct 


gcataggaag 


5520 


ctgatgcgcc 




gtcaaataaa 


tcgctaaaaa 


aggaatgctc 


atcgatgt tg 


5580 


ccattctgcc 


gaaaatggtt 


ccgattataa 


ttgtaacgcg 


ttggatgcat 


agcttgagta 


5640 


ttctatagtg 


tcacctaaat 


agcttggcgt 


aatcatggtc 


atagctgttt 


cctgtgtgaa 


5700 


attgttatcc 


gctcacaatt 


ccacacaaca 


tacgagccgg 


aagcataaag 


tgtaaagcct 


5760 


ggggtgccta 


atgagtgagc 


taactcacat 


taattgcgtt 


gcgctcactg 


cccgctttcc 


5820 


agtcgggaaa 


cctgtcgtgc 


cagctgcatt 


aatgaatcgg 


ccaacgcgcg 


gggagaggcg 


5880 


gtttgcgtat 


tgggcgctct 


tccgcttcct 


cgctcactga 


ctcgctgcgc 


tcggtcgttc 


5940 


ggctgcggcg 


agcggtatca 


gctcactcaa 


aggcggtaat 


acggttatcc 


acagaatcag 


6000 


gggataacgc 


aggaaagaac 


atgtgagcaa 


aaggccagca 


aaaggccagg 


aaccgtaaaa 


6060 


aggccgcgtt 


gctggcgttt 


ttcgataggc 


tccgcccccc 


tgacgagcat 


cacaaaaatc 


6120 


gacgctcaag 


tcagaggtgg 


cgaaacccga 


caggactata 


aagataccag 


gcgtttcccc 


6180 


ctggaagctc 


cctcgtgcgc 


tctcctgttc 


cgaccctgcc 


gcttaccgga 


tacctgtccg 


6240 
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cctttctccc 


ttcgggaagc 


gtggcgcttt 


ctcatagctc 


aegctgtagg 


tatctcagtt 


6300 


cggtqtaqgt 


cgttcgctcc 


aaqctqqqct 


qtqtqcacqa 


accccccgtt 


cagcccgacc 


6360 


gctgcgcctt 


atccggtaac 


tatcgtcttg 


agtccaaccc 


ggtaagacac 


gaettatege 


6420 


cactggcagc 


agccactggt 


aacaggatta 


qcaqaqcqaq 


qtatqtaqac 


ggtgctacag 


6480 


agttcttgaa 


atqatqqcct 


aactacggct 


acactagaag 


gacagtattt 


ggt at ct gcg 


6540 


ctctgctgaa 


gccagttacc 


ttcggaaaaa 


oaattaataa 


ctcttgat cc 


ggcaaacaaa 


6600 


ccaccgctgg 


tagcggt ggt 


ttttttgttt 


gcaagcagca 


gattacgege 


agaaaaaaag 


6660 


gatctcaaga 


agatcctttg 


atcttttcta 


eggggtet ga 


cgctcagtgg 


aacgaaaact 


6720 


cacgttaagg 


gattt tggtc 


atgagattat 


caaaaaggat 


cttcacctag 


atccttttaa 


6780 


attaaaaatg 


aagttttaaa 


tcaatctaaa 


gtatatatga 


gtaaacttgg 


tctgacagtt 


6840 


accaatgctt 


aatcagtgag 


gcacctatct 


cagegatctg 


tetatttegt 


tcatccatag 


6900 


t t gcctgact 


ccccatcata 


tacrataaota 


cna taroocf^ 

V-J a L- a ^uyua 


gggcttacca' tctggcccca 


6960 


gtgctgcaat 


gat accgcga 


gacccacgct 


racraartrr 

\^ \-4 \£ L. N-J 


agatttatca 


gcaataaacc 


7020 


aqccagccgg 


aagggccgag 


cgcagaag tg 


gtcctgcaac 


tttatccgcc 


tccatccagt 


7080 


ctat. t aat tg 


t tgccgggaa 


get agagt a a 


gtagttcgcc 


agttaatagt 


ttgcgcaacg 


7140 


t tg t tggca.t 


tgctacaggc 


atcgtggtgt 


cacgctcgtc 


gtttggtatg 


gcttcattca 


7200 


gctccggt t c 


ccaacgatca 


aggegagt t a 


cat gatcccc 


catgttgtgc 


aaaaaagegg 


7260 


ttagctcctt 


cggtcctccg 


atcgttgtca 


gaagtaagtt 


ggccgcagtg 


ttatcactca 


7320 


tggttatggc 


agcactgcat 


aattctctta 


ctgtcatgcc 


ateegtaaga 


tgcttttctg 


7380 


tgactqqtqa 


gtactcaacc 


aagtcattct 


gagaataccg 


cgcccggcga 


ccgagttgct 


7440 


cttgcccggc 


gtcaatacgg 


gataatagtg 


tatgacatag 


cagaacttta 


aaagtgctca 


7500 


tcattggaaa 


acgttcttcg 


gggegaaaac 


tctcaaggat 


cttaccgctg 


ttgagatcca 


7560 


gttcgatgta 


acccactcgt 


gcacccaact 


gatcttcagc 


atcttttact 


ttcaccagcg 


7620 


tttctgggtg 


agcaaaaaca 


ggaaggcaaa 


atgccgcaaa 


aaagggaata 


agggegacac 


7680 


ggaaatgttg 


aatactcata 


ctcttccttt 


ttcaatatta 


ttgaagcatt 


tatcagggtt 


7740 


attgtctcat 


gagcggatac 


atatttgaat 


gtatttagaa 


aaataaacaa 


ataggggttc 


7800 


cgcgcacatt 


tccccgaaaa 


gtgccacctg 


tatgcggtgt 


gaaatacege 


acagatgegt 


7860 


aaggagaaaa 


taccgcatca 


ggcgaaattg 


taaacgttaa 


tattttgtta 


aaattcgcgt 


7920 


taaatatttg 


ttaaatcagc 


tcatttttta 


accaataggc 


egaaategge 


aaaatccctt 


7980 
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ataaat caaa 


agaatagacc 


gagatagggt 


tgagtgttgt 


tccagtttgg 


aacaagagtc 


8040 


cactattaaa 


gaacgtggac 


tccaacgtca 


aagggcgaaa 


aaccgtctat 


cagggcgatg 


8100 


gcccactacg 


tgaaccatca 


cccaaatcaa 


gttttttgcg 


gtcgaggtgc 


cgtaaagctc 


8160 


taaatcggaa 


ccctaaaggg 


agcccccgat 


ttagagcttg 


acggggaaag 


ccggcgaacg 


8220 


tggcgagaaa 


ggaagggaag 


aaagcgaaag 


gagcgggcgc 


tagggcgctg 


gcaagtgtag 


8280 


cggtcacgct 


gcgcgtaacc 


accacacccg 


ccgcgcttaa 






8320 



<210> 74 
<211> 250 
<212> PRT 

<213> Clostridium acetobutylicum 
<400> 74 

Asn Lys Arg Ala Ala Phe Met Leu Leu Leu Phe Leu Arg Ser Val Leu 
15 10 15 v 

Lys Val He Leu Val Leu Asp Val Gly Asn Thr Asn He Val Leu Gly 
20 25 30 

He Tyr Asn Asp Thr Lys Leu Thr Ala Glu Trp Arg Leu Ser Thr Asp 
35 40 ' 45 

Val Leu Arg Ser Ala Asp Glu Tyr Gly He Gin Val Met Asn Leu Phe 
50 55 60 

Gin Gin Asp Lys Leu Asp Pro Thr Leu Val Glu Gly Val He He Ser 
65 70 75 80 

Ser Val Val Pro Asn He Met Tyr Ser Leu Glu His Met He Arg Lys 
85 90 95 

Tyr Phe Lys He Asn Pro Leu Val Val Gly Pro Gly He Lys Thr Gly 
100 105 110 

He Asn He Lys Tyr Asp Asn Pro Lys Glu Val Gly Ala Asp Arg He 
115 120 125 

Val Asn Ala Val Ala Ala His Glu He Tyr Lys Arg Ser Leu He He 
130 135 140 

He Asp Phe Gly Thr Ala Thr Thr Phe Cys Ala Val Arg Glu Asn Gly 
145 150 155 160 

Asp Tyr Leu Gly Gly Ala lie Cys Pro Gly He Lys Val Ser Ser Glu 
165 170 175 

Ala Leu Phe Glu Lys Ala Ala Lys Leu Pro Arg Val Glu Leu lie Lys 
180 185 190 

Pro Ala Tyr Ala lie Cys Lys Asn Thr lie Ser Ser lie Gin Ser Gly 
195 200 205 
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He Val Tyr Arg Tyr Leu Arg Gin Val Lys Tyr Leu Phe Glu Lys Leu 

210 215 ~ 220 

Lys Glu Asn Leu Pro Asp Gly Arg Arg Thr Arg Thr Ser Leu Val Leu 
225 230 235 240 

Ala Thr Gly Gly Leu Ala Lys Leu He Asn 



<210>- 75 
<211> 258 
<212> PRT 

<213> Rhodobacter capsulatus 
<400> 75 

Met Leu Leu Cys He Asp Cys Gly Asn Thr Asn Thr Val Phe Ser Val 
15 10 15 

Trp Asp Gly Thr Asp Phe Ala Ala Thr Trp Arg He Ala Thr Asp His 
20 25 30 

Arg Arg Thr Ala Asp Glu Tyr Phe Val Trp Leu Asn Thr Leu Met Gin 
35 40 45 

Leu Lys Gly Leu Gin Gly Arg He Ser Glu Ala He He Ser Ser Thr 
50 55 60 

Ala Pro Arg Val Val Phe Asn Leu Arg Val Leu Cys Asn Arg Tyr Phe 
65 70 75 8C 

Asp Cys Arg Pro Tyr Val Val Gly Lys Pro Gly Cys Glu Leu Pro Val 
85 90 95 

Ala Pro Arg Val Asp Pro Gly Thr Thr Val Gly Pro Asp Arg Leu Val 
100 105 110 

Asn Thr Val Ala Gly Tyr Asp Arg His Gly Gly Asp Leu He Val Val 
115 120 125 

Asp Phe Gly Thr Ala Thr Thr Phe Asp Val Val Ala Fro Asp Gly Ala 
130 135 140 

Tyr He Gly Gly Val He Ala Pro Gly Val Asn Leu Ser Leu Glu Ala 
145 150 155 160 

Leu His Met Ala Ala Ala Ala Leu Pro His Val Asp Val Thr Lys Pro 
165 170 175 

Gin Gly Val He Gly Thr Asn Thr Val Ala Cys He Gin Ser Gly Val 
180 185 190 

Tyr Trp Gly Tyr He Gly Leu Val Glu Gly He Val Arg Gin He Arg 



245 



250 



195 



200 



205 



Met Glu Arg Asp Arg Pro Met Lys Val He Ala Thr Gly Gly Leu Ala 
210 215 220 
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Ser Leu Phe Asp Leu Gly Phe Asp Leu Phe Asp Lys Val Glu Asp Asp 
225 230 235 240 

Leu Thr Met His Gly Leu Arg Leu He Phe Asp Tyr Asn Lys Gly Leu 
245 250 255 

Gly Ala 

<210> 76 
<211> 10801 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Recombinant 
pAN240 plasmid 



<400> 76 
gaattttgcg 


gccgcttcga 


aagctgtaat 


ataaaaacct 


tcttcaacta 


acggggcagg 


60 


ttagtgacat 


tagaaaaccg 


actgtaaaaa 


gtacagtcgg 


cattatctca 


tattataaaa 


120 


gccagtcatt 


aggcctatct 


gacaattcct 


gaatagagtt 


cataaacaat 


cctgcatgat 


180 


aaccatcaca 


aacagaatga 


tgtacctgta 


aagatagcgg 


taaatatatt 


gaattacctt 


240 


tattaatgaa 


ttttcctgct 


gtaataatgg 


gtagaaggta 


attactatta 


ttattgatat 


300 


ttaagttaaa 


cccagtaaat 


gaagtccatg 


gaataataga 


aagagaaaaa 


gcattttcag 


360 


gtataggtgt 


tttgggaaac 


aatttccccg 


aaccattata 


tttctctaca 


tcagaaaggt 


420 


ataaatcata 


aaactctttg 


aagtcattct 


ttacaggagt 


ccaaatacca 


gagaatgttt 


480 


tagatacacc 


atcaaaaatt 


gtataaagtg 


gctctaactt 


atcccaataa 


cctaactctc 


540 


cgtcgctatt 


gtaaccagtt 


ctaaaagctg 


tatttgagtt 


tatcaccctt 


gtcactaaga 


600 


aaataaatgc 


agggtaaaat 


ttatatcctt 


cttgttttat 


gtttcggtat 


aaaacactaa 


660 


tatcaatttc 


tgtggttata 


ctaaaagtcg 


tttgttggtt 


caaataatga 


ttaaatatct 


720 


cttttctctt 


ccaattgtct 


aaatcaattt 


tattaaagtt 


catttgatat 


gcctcctaaa 


780 


tttttatcta 


aagtgaattt 


aggaggctta 


cttgtctgct 


ttcttcatta 


gaatcaatcc 


840 


ttttttaaaa 


gtcaatatta 


ctgtaacata 


aatatatatt 


ttaaaaatat 


cccactttat 


900 


ccaattttcg 


tttgttgaac 


taatgggtgc 


tttagttgaa 


gaataaagac 


cacattaaaa 


960 


aatgtggtct 


tttgtgtttt 


tttaaaggat 


ttgagcgtag 


cgaaaaatcc 


ttttctttct 


1020 


tatcttgata 


ataagggtaa 


ctattgaatt 


cggtaccaag 


agtttgtaga 


aacgcaaaaa 


1080 


ggccatccgt 


caggatggcc 


ttctgcttaa 


tttgatgcct 


ggcagtttat 


ggcgggcgtc 


1140 
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ctgcccgcca 


ccctccgggc 


cgttgcttcg 


caacgt tcaa 


atccgctccc 


ggcggat ttg 


1200 




Clrl ClrlClCClt t~ c 
y a u uvj \^ y i~ v.- 


O y CI V^. Li d d V — - 


cj c* ^ ci y ci w u *- j ci 


acgaaaggcc 


raatrt*ffrn 

■ ci y v# w l l ^ — y 


1260 


ty ay u i_ 


tcattttatt 


taatacctoa 

i» y q i_ y \— • ^— y y 


caattcccta 


ctctcacato 


y y y <j y a v-* ^ — v — 


1320 


uoa\rf a l» v*a v. 


u y y *-»y ^* ^""y 


a cot* ttcact 


t"pt"aacrttca 

>— ' — l- v— y 


y *w a l» y y y y i— 


a nat aocrapr* 

a y y t-yyyav^v*- 


1380 




l y LL>yuL>ciyy 


vu CI CI I- LwLU I— 


t~"t~t"at~f*ac*af" 


Lyt-i, LLty^y 


ttctoattta 

u l u y a c v_ Ld 


14 4 0 


ai~r*t'ni"a'l~r" , a 


yyLLyddddL 


L L LULL-Ltd L. 


LLyLLddddL 


ay yd LLLtdL 


yyddd l y y dy 


i son 


L.yyL.daciaL.L. 


y 1. LLLdLLLL 


f"*aaa.a , t~f^t"'t~,a 
LddddULL Ld 


aaanaaaarr 


UL-Lydidddy 


yyyyLL L L LL 


X JOv 


4~ 4~ <~" 4" 3f3 3 3 3 


LigLdcggyL 


4" nn4~ 1" rnf ^ ^> 

igytLLyt 


LLLdyCdLLL 


y LLLddLLLL 


rt^ht" 1" na f pa 
y L LL LydLLd 


J. DZ U 


4" 4* n S J*Y ^ 3 1— • ^ /*Y 

ttCayaaCag 


• /— • 4- +- 4- /t r-r 


4" h / n rnr*+' 

cLCatyyutL 


/~r • 4~ ^ 4~ 4* 

yccycutctL 


ydLLdydLdL 


LdtL Ltycdy 


1 fiftfl 
lOOU 


LJaad Lda Laa 


ILJdLLLLdLL 


4- p /~- 4- +" / — i / — • 4— /»»/-» 
LLLL LLLLLJL 


dCadyyoy Ly 


L,y y l. l y LdLt 


ytLLdayLal 


17 4 0 


d Lgacgccgc 


itccccgtLL 


aCCayyda Ld 


dLdLdCy L LL 


CddgdCy tg c 


LCCdLLdLLd 


i qnn 


tld LlCdtaa 




4- /-^ -j 4" t/anna 

LtCdLLdytjd 


dyCdLLCCOd 


/~>a/^^a4-^3 a4~ 
CdgCdLCddL 


ydydLCLLLd 




LLddL Ly Ldd 


-f— /-«r 4- y« p n a 


a 4" a n4* 4~ a n n 

dLdy l uudyy 


LLLytL LL-uy 


LddLdy L l y L. 


LLLy Lyddy L 


1 Q90 


L L^uL(JL Ltd 


LLaLLyL Log 


aha^ah a 4~ t~ a 
dLdtdLdL Ld 


4--a4-4-<^4-/->4-,^</^-« 
LdLLL.LL.LLL 


dL LLL.LL.ydd 


4- a^naat" aat* 
LdLLddLddL 


1 QPO 

± ^ O VJ 


/~Tat~;a4~4--)4-/— +- 


a 4* 4* a a a r* /^f 
d L LdddLyLy 


' /-■ 4- 4~ 4~ 1* n a a a a 
L L L L Lydddd 


dyuddL-LyLd 


a^art^nanaa 
dL-dyuydyda 


LLdLLLLLLL 


90 4 0 


a n f a a ^ t" fpa 
d y i_cl d L L L La 


4- i*papartrtr ,, t" 

l uududyy l 


Lyay l un^yy 


a i" anna a haa 
dLdyyadLda 


dy ULLyLaL-d L 


a n i~ f~ at*nni" 
dy LLLdLyy l 


2100 


LLLyLLdy L L 


rr 4~ 4- 4" f>a a f na 


t*at*rt 1* frtnr 

L d L L L. L l_ LyL. 


ayL> L L LLdLO 




yaLULULL LL 


21 60 




a^aay l. Lu^y 


rapt t* nt" +■ t"n 
cqlu uy L L Ly 




LaL-ayu L. Lay 




2220 


LUt i— I- L. V— l_ V- CI 


y u v_ y l l o oy i. 


^t"^r , ^t*t*rypTT 
a i— o . c* \_, uy uy 


ay ay ^ l l i~ i- y 


nrtaanrrnl" 

y w l a a y v_»\-»y *— 




2280 


gaccg t*a teg 


acaggaa cca 


attcaatatc 


catgaagaag 


t cgctga 1 1 a 


acccatcaac 


2340 


aacagc tac.c 


t gctgcgca t 


ct t ttaaacc 


gaaataggca 


cgagtcggct 


tgactagat t 


2400 


gaaaagcttc 


gtcagtacga 


tcgcgacccc 


gtcaaaatgt 


ccttctcttg 


agcgcccgca 


2460 


taacacgtct 


gtgcgtcttt 


ctacatgaat 


cgtgacattc 


ttttcaccgg 


gatacatatc 


2520 


atgagcatct 


ggcgtaaaaa 


gaatatcgac 


tccggcgttt 


tctgcaagag 


ctgcatcccg 


2580 


ctcaatatcg 


cgcggatatg 


cttcaaaatc 


ttcattaggg 


ccgaattgtg 


caggattcac 


2640 


aaaaatactc 


ataataacgg 


cgtcgttttc 


ttgtcttgct 


ttgtctgcta 


aggttaaatg 


2700 


cccctcatgc 


agaaacccca 


tcgtcggaac 


aaatccgatt 


gacttgccct 


ctgaatggta 


2760 


ttgttttatg 


gcttctttca 


gctgtgaaat 


atcagtaatc 


tgtctcatct 


tattttcccc 


2820 


cgtacaagcc 


gtcaagcact 


gtctggttca 


tttgaaagga 


atgcttttgt 


tcagggaaag 


2880 
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cacgatgtct 


tacatcctga 


acatatccgc 


tgattgctgt 


ttcgatggtt 


teatcaatge 


2940 


acatatatta 


ctt tacaaat 


ttaggtgt tc 


tctcaacacc 


gtggccgata 


atatca tgat 


3000 


aaacgagaac 


ttgtccgtcc 


gctttcacac 


cagccccgat 


tccaatgacc 


ggtatgc tta 


3060 


gcgtctcggc 


aat tttggct 


gtgagttctg 


ccggcacaca 


t tccagcaca 


agcatcatag 


3120 


rt rrf art tr 


ttcacatttt 


atac tgtc tt 


ctattaattt 

V* C C* C C Ci CI I— I— I— 


ttt ggegett 


tgttcgtctt 


3180 




V- L i_a Lay tty 


L-L.tayLdt.yL. 


LydLLyaLLy 


L- y y Ly LLdua 


LLLddVj Ly dt 


3240 




ad L<y LLLLLCl 


ay L»y l La a l y 


rnrna pi t* nrr^ 

L>yL>y da l y y a 


L t v^aa a v.a v — cj 


c^rttc^tf^mc 

L«LL LL>LL>LryL> 


3300 


Lrv^L^aauuL l 


v^ay Ly^y l L»a 


ncTf^ccict t t 

LjLLLLyL L L L 


L» L LyddLya L 


dyLLyLLyLd 


c c c l LLay ty 


3360 


LdLLLLLtL L 


ctya^ay y iya 


LddydLa L d d 


J Sj La. LL 


LyLLdLddLd 


ddyy l d l Lty 


34 20 


yL-yLdL-L-LL. L 




tttrtt^tn^\t 

LLLLjLaLyaL 


yy a LLdLy LL 


Lytddt t y L L- 


dtdttydtdy 


JtOU 




y^Lydyyaty 


d L La L LLLad 


y Ly dd LtdtL 


y aL-Laddd L L- 


d Ly Ltddt L t 


*3S4 0 


tLyttiy ill 


a Apa ant - i" i" a 
a y L* d d y l LLa 


yLLyL.L.yyciL 


dd L L. d LddyL 


yy LtayL-dty 


dLdd L L>y y L L 


Jj \J\J \J 


l L LLoyatiu 


L L LLtJ L L L L L 




crt* t* i tat t t t 

yLLLLyLLLL 


catat tt t rt 

LdHJL L Ll-tL 


ttLttLLtLd 


3660 


yay tt<ui.y 


LLyLLyLLaa 


yaL L a L L u 1- n 


LLdLaLLL L y 


L ay a L dddy L 


LddtddLLLL 


3720 


l. t y L d a a a L L 


L L L L. a y y a a L. 


L L LaULQUdU 


y LLyLLLLLjy 


atntaciAAr'Zi 
a l y l a y a aL. a 


aatiLrOLLLLL 


3730 


rT^n(~"t" ft" +■ n t~ 

L L y L. L l—. I— i— y 


y l Ly l loyyfl 


tatct ttctt 

LaLL»LLLL»LL 


yyaayu l. a y y 


L- a ^-4 vj l» V— * c c* cj a 


at t" a t"oopaa 

y l lu uyyuay 


3840 




>^ M CI CI CI CI CI CI C-l 


a ^ j a v_> v.* y \ c l _ i _ i _ 


\j Cl LI ^ * CI CI CI C4 Vh^ 


aatcttt ttc 


aat ttctttt 

vj Li L L L V— ' l* l- i— 


3900 


tacactcaca 


gccacttttg 


caaaaaccgg 


a cage t teat 


gecttataac 


tgctgttt eg 


3960 


gtcgacgatg 


atctgccgt t 


t tcttctgca 


agecaaaaaa 


ccttccgt ta 


caacgagaag 


4020 


gattct tcac 


1 1 tctaaagt 


tcggcgag tt 


t ca tccctct 


gtcccagtcc 


ttttttggat 


4080 


caaggcagac 


tgctgcaatg 


tctatctatt 


ttaataatag 


qtqcaqttcq 


caggegatae 


4140 


tgcccaatgg 


aagtatacca 


aaatcaacgg 


gcttgtacca 


acacattagc 


ccaattcgat 


4200 


atcggcagaa 


tagatttttt 


taatgccttc 


gttcgtttct 


aaaagcagaa 


cgccttcatc 


4260 


atctatacct 


aacgccttac 


cgtaaaaggt 


teegtttaac 


gttctggctc 


tcatattagt 


4320 


gccaataccg 


agcgcatagc 


tttcccataa 


aagcttaatc 


ggegtaaate 


cgtgcgtcat 


4380 


ataatcccgg 


taccgtttct 


caaagcatag 


taaaatatgc 


tggatgaege 


cggcccgatc 


4440 


aattttttcc 


ccagcagctt 


ggctgaggct 


tgtcgcgatg 


tccttcaatt 


catctggaaa 


4500 


atcattaggc 


tgctggttaa 


cgttaatgcc 


gatcccaatg 


atcactgaac 


gtacgcggtc 


4560 


ttcttcagcc 


tgcatttccg 


ttaggatacc 


gactgttttt 


tttccgttaa 


tcaaaatatc 


4620 
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atttggccat 


ttaatatccg 


tttggatgcc 


tgctgcctct 


tctattccct 


gcacaacagc 


4680 


tactgcagca 


agcagagtca 


gctgcggtqt 


tt tttggagc 


qqaatqtcaq 


geegcaaaat 


4740 


caggctcatc 


caaacaccgt 


ttccttcttg 


agaatgecat 


accctagaca 


ttcggcccct 


4800 


tccggctgtt 


tgtttgtcag 


ccaccacaag 


ggtgccttcc 


ggtgcgttat 


tattcgegag 


4860 


ct catgagcc 


gttttttgcg 


tgcttgaaag 


aacgtcatgg 


taaataagat 


gctggcccat 


4920 


cacttccgtt 


tttaatccaa 


aacgaatttc 


gctttcactg 


agttttccgg 


gttttttgat 


4980 


gagccgatat 


ccttttcttc 


taaeggctte 


tact teat aa 


ccctct ttcc 


gaagct ctt c 


5040 


aatatgcttc 


cacacagcag 


t tct tgaaca 


gecgagagca 


tcactgatt t 


tttggccgga 


5100 


aataaat tea 


t tgccggcct 


gagaaaataa 


ttcaataagg 


tcttttctta 


atgttgaccg 


5160 


ca t gt ct t ca 


gecact cctc 


tatgtgtttc 


ttttgattgg 


agagcttccc 


tgtcacaaca 


'5220 




LL.^aij lvj l a u 


l— L, \— LVJ 




V^VJVJ VJ 


V- C L U Q O 


5280 




c c\~ i~ a c c c nf~ 


yd ua. uvaay a 


fcrttaacrao 


ttttaatrao 


raaattttaa 

ca y u \_ \_ u y u 


5340 


taagegtae t 


aaatatcctt 


cag t 1 1 ct tt 


tcatccaatt 


tttcgttttg 


ccgaagc t gc 


5400 


gatat tt tgg 


cegctgagag 


cagtget ttt 


t tec cage te 


tgtacat tgt 


cat tgegtea 


5460 


aggctctggc 


caaacgt ate 


gqcaatgtga 


atggcttcct 


tga tcactt t 


t cccgggagc 


5520 


ttccaggct t 


tcaggaaaag 


gggegegtet 


ttcaaaacta 


tgccaaggt t 


aattaaaaga 


5580 


gcagcccaaa 


get cctcacg 


ggatgttaaa 


gagaagaatg 


gaaact cact 


cgt tgaaatc 


5640 


aggt tttct c 


gtttatgata 


aaaaccagga 


agct cttcat 


acaatctcgt 


ttgaatgagt 


5700 


atttqaaqccf 


cctaaccraaa 


agctcttccc 


tgcagcaatt 


tctcaaactc 


tatagttttt 


5760 


cgttcgactg 


aaacatggga 


qaqqaqtqat 

3 3 3 3 ^3 


ttttctttcg 


caatggcttc 


ttctgtttcc 


5820 


ggtgaaagcg 

3 3 -3 3 3 


taaagccaag 


ctggctcata 


aagegtaegg 


ctctcagcat 


aegaagegea 


5880 


tcctcttgaa 


atctatcctc 


aggctttcca 


acggttcgaa 


tcactttctg 


atcaatatct 


5940 


ttcttgccgc 


caaaataatc 


aagcaccttc 


ccgtccgctg 


teatggecat 


egcattgate 


6000 


gttaaatctc 


tgcgttttag 


atcctcttct 


aatgatgaga 


taaattgeae 


ttctgacggt 


6060 


cttctgaaat 


caacataatc 


agattcagtc 


eggaatgteg 


tgacttcata 


ggtttcatcc 


6120 


tcccagagca 


caataatggt 


cccgtgctct 


ttgectacat 


caacagtccg 


ctgaaacagc 


6180 


cgttctactt 


gatcaggtgc 


cgcatctgtc 


gegatatega 


catctccgat 


cgttcgtttc 


6240 


atatagctgt 


cacgaactgc 


gcccccgaca 


aaataagect 


gatggcccgc 


ttcgattaag 


6300 


atgeggagea 


cgggaagtgc 


tttgataaaa 


actttttcca 


tgtgatcact 


ccggttctgc 


6360 
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taaatcggca 


taaatctgtt 


catactggct 


gacaattttt 


ttagaagaaa 


attcattttc 


6420 


aagcatctct 


attgccgcct 


ttgtaaaacg 


attgettage 


tgttcatctt 


etaaaatget 


6480 


catcqcqcqq 


gctgttgcgq 


ccgtaacatc 


accgacatcc 


accaaaaatc 


cgctcacatt 


6540 


gttttttata 


acctcaggga 


taccgccaat 


gtttgttcca 


atacaaggca 


ctccgcaagc 


6600 


catcgcttca 


agcaggacaa 


ggecaaaget 


ttctttttca 


gatagcagca 


gcttcaaatc 


6660 


gctaatagaa 


taaagatctt 


caacacggtc 


ttga t ttcca 


agcattaaga 


cttggtcttc 


6720 


caagccatat 


tttctgataa 


actcacaaac 


tgtcgatttc 


tccggaccgt 


ctccgactaa 


6780 


aagcagct tc 


gctttcgttt 


tgecagega t 


a 1 1 geggaac 


acaeggatga 


ca tcctgcac 


6840 


gcgt gcagcc 


actggtaaca 


ggattagcag 


agegaggtat 


gtaggcggtg 


ctacagagt t 


6900 


cttgaagtgg 


t ggectaact 


aegget acac 


tagaaggaca 


gtatttggta 


t ctgcgctct 


6960 


act aaaacCci 


at t acct t ca 


oaaaaaoaat 

\4 V* kJL U ^ W4 \£ 


tacftaoctct 


taatccoaca 

\~ *H a ** V** >j ^ V*» S-* 


aacaaaccac 

U W W* VA W w O 


7020 


cgct ggt age 


ggtggtt ttt 


t tg t ttgcaa 


gcagcaga 1 1 


aegegcagaa 


aaaaaggatc 


7080 


tcaagaagat 


cctt tga t ct 


t ttctacggg 


gtctgacget 


cagtggaacg 


aaaactcacg 


7140 


ttaagggatt 


t tggtcatga 


gatt atcaaa 


aaggatcttc 


acctagatcc 


t t ttaaatta 


7200 


aaaa tgaagt 


t ttaaa t caa 


t ctaaagt at 


atat gagtaa 


act tggt ctg 


acagt t acca 


7260 


atgettaate 


agtgaggcac 


ctatctcagc 


gatctgtcta 


tttcgttcat 


cca tagt tgc 


7320 


ctgactcccc 


qtcqtqtaqa 


t aactacgat 


acqqqaqqqc 


ttaccatctg 


gccccagtgc 


7380 


tgcaatgata 


ccgcgagacc 


cacgctcacc 


ggctccagat 


ttatcagcaa 


taaaccagcc 


7440 


aqccggaaqg 

7 ZJ J) 3 3 


gccqaqcqca 


qaaqtqqtcc 

3 3 3 3 


tgcaacttta 


tccgcctcca 


tccagtctat 


7500 


taattgttgc 


egggaagcta 


gagtaagtag 


ttcgccagtt 


aatagtttgc 


gcaacgttgt 


7560 


tgecattget 


acaggcatcg 


tggtgtcacg 


ctcgtcgttt 


ggtatggctt 


cattcagctc 


7620 


cggttcccaa 


cgatcaaggc 


gagttacatg 


atcccccatg 


ttgtgcaaaa 


aagcggttag 


7680 


ctccttcggt 


cctccgatcg 


ttgtcagaag 


taagttggcc 


gcagtgttat 


cactcatggt 


7740 


tatggcagca 


ctgeataatt 


ctcttactgt 


catgccatcc 


gtaagatget 


tttctgtgac 


7800 


tggtgagtac 


tcaaccaagt 


cattctgaga 


atagtgtatg 


cggcgaccga 


gttgctcttg 


7860 


cccggcgtca 


ataegggata 


ataccgcgcc 


acatagcaga 


actttaaaag 


tgctcatcat 


7920 


tggaaaacgt 


tettegggge 


gaaaactctc 


aaggatctta 


ccgctgttga 


gatccagttc 


7980 


gatgtaaccc 


actcgtgcac 


ccaactgatc 


ttcagcatct 


tttactttca 


ccagcgtttc 


8040 


tgggtgagca 


aaaacaggaa 


ggcaaaatgc 


cgcaaaaaag 


ggaataaggg 


egacaeggaa 


8100 
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atgttgaata 


ctcatactct 


tcctttttca 


atattattga 


agcatttatc 


agggttattg 


8160 


tctcatgagc 


ggatacatat 


ttgaatgtat 


ttagaaaaat 


aaacaaatag 


gggttccgcg 


8220 


cacatttccc 


cgaaaagtgc 


cacctgacgt 


ctaagaaacc 


attattatca 


tgacattaac 


8280 


ctataaaaat 


aggcgtatca 


cgaggccctt 


tcgtctcgca 


tgcggatcag 


tgagggtttg 


8340 


caactgcggg 


tcaaggatct 


ggatttcgat 


cacggcacga 


tcatcgtgcg 


ggagggcaag 


8400 


qqctccaaqq 


atcgggcctt 


gatgttaccc 


qaqaqcttqq 


cacccagcct 


qcqcqaqcaq 


8460 


qqqaattqat 


ccqqtqqatq 


accttt tgaa 


tgacctttaa 


tagattatat 


tactaattaa 


8520 


ttqqqqaccc 


tagaggtccc 


cttttttatt 


ttaaaaattt 


tttcacaaaa 


cggtttacaa 


8580 


gcataacggg 


ttt tgctgcc 


cqcaaacqqq 


ctgttctggt 


gttgctagtt 


tgttatcaga 


8640 


atcccagatc 


cqqct tcaqq 


tttgccggct 


gaaagcgcta 


tttcttccag 


aattgccatg 


8700 


attttttccc 


cacqqqaqqc 


gt cactggct 


cccgtgttgt 


cggcagcttt 


gattcgataa 


8760 


gcagcatcgc 


ctgtt tcagg 


ct gtctatgt 


qtaactqtt a 


agctgtaaca 


ag ttgtctca 


8820 


ggtgttcaat 


ttcatgttct 


agttgctttg 


ttttactggt 


t tcacctgtt 


ctat^aqqtq 


8880 


t tacatgctg 


ttcatctgtt 


acattgtcga 


tctgttcatg 


gtgaacagct 


ttaaatgcac 


8940 


caaaaactcg 


taaaagctct 


gatgtatcta 


tcttttttac 


accgt tttca 


tctgtgcata 


9000 


tggacagttt 


tccctttgat 


atctaacggt 


gaacagttgt 


tctacttttg 


tttgttagtc 


9060 


ttgatgcttc 


actgatagat 


acaagagcca 


taagaacctc 


agatccttcc 


gtatttagcc 


9120 


agtatgttct 


ctaqtqtqqt 


tcgttgtttt 


tgcgtgagcc 


atgagaacga 


accattgaga 


9180 


tcatgcttac 


tttgcatgtc 


actcaaaaat 


tttgcctcaa 


aactggtgag 


ctgaattttt 


9240 


gcagttaaag 


catcgtgtag 


tgtttttctt 


agtccgttac 


gtaggtagga 


atctgatgta 


9300 


atggttgttg 


gtattttgtc 


accattcatt 


tttatctggt 


tgttctcaag 


ttcggttacg 


9360 


agatccattt 


gtctatctag 


ttcaacttgg 


aaaatcaacg 


tatcagtcgg 


gcggcctcgc 


9420 


ttatcaacca 


ccaatttcat 


attgctgtaa 


gtgtttaaat 


ctttacttat 


tggtttcaaa 


9480 


acccattggt 


taagcct ttt 


aaactcatgg 


tagttatttt 


caagcat taa 


catgaactta 




aattcatcaa 


ggctaatctc 


tatatttgcc 


ttgtgagttt 


tcttttgtgt 


tagttctttt 


9600 


aataaccact 


cataaatcct 


catagagtat 


ttgttttcaa 


aagacttaac 


atgttccaga 


9660 


ttatatttta 


tgaatttttt 


taactggaaa 


agataaggca 


atatctcttc 


actaaaaact 


9720 


aattctaatt 


tttcgcttga 


gaacttggca 


tagtttgtcc 


actggaaaat 


ctcaaagcct 


9780 


ttaaccaaag 


gattcctgat 


ttccacagtt 


ctcgtcatca 


gctctctggt 


tgctttagct 


9840 
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aataraccat 


aagcat t t tc 


cctactgatg 


ttcatcatct 


qaocqtatta 


gttataagtg 


9900 




trcattcttt 


crttataaaa 


ttttcaa tea 




tagtgecaca 


9960 


mcic at" s a a a 


ttaocttoot 

\— L_ CI ^# ^ l_ U \J L. 


t tea tact cc 


gt taagtcat 


agegactaat 


cgctagttca 


10020 


tt trrrtrt tn,a 


uaa^ociu una 


firanarafa 


fatrtraatt 


oatctaoata 


attttaatca 


10080 




Ly ay a j y 3 


fantpaatrrA 

l» 0 y l oau L^D 


taattartaa 


tccttttcct 


ttgagttgtg 


10140 


yy LaiL t_ y i_ a 


adLt^iy^Ud 


y ci ww u u v.y u l 


yyaddauL l_ y 


t^aattctac 


tanarrrtrt 


10200 


y UaddL Lucy 


pt" ana p^t" t~ +- 


rrtrrt*nM-tti- 


1 1 1 nt 1 1 at a 

L L uy t L La La 


c LLooy l y y l. 


tataatttat 


10260 


dydolaaaya 


dayddLaaaa 


ddayd Ldada 


anaahanai* f 
dyadLd^aLL 


LLdyLLLLy i_ 


y l.o> LaaLiLfl 


10320 


CtaCtLLdyi. 


pa/tf- 4r /~» /r ;a 
LdLj L L^L.ytd 


nfaf"haraaa 
y Lai LdLdda 


dy yd l. y LL-y u 


dddLyLLy l l. 


i_ywCwwi-wuci 


10380 


caaaacagac 


/— * 4~ +* ^ r~* 

CttdaaaCCC 


LdddyyCLLd 


arrf /«f :3 

ayLagCdCLC 


LLy Lddy L L L. 


yyyudddLLy 


104 4 0 


ctgaatatt c 


cttttgtctc 


cgaccat cag 


gcacctgagt 


^ /r 4"» /~*t 4" /"* 4- 4~ 4~ 

cgctyLCLLL 


LLCytydCd t. 




tcagttcgct 


gcgctcacgg 


ctctggcagt 


gaatgggggt 


aaatggcact 


acaggcgcct 


10560 


tttatggatt 


catgcaagga 


aactacccat 


aatacaagaa 


aagcccgtca 


cgggcttctc 


10620 


agggcgtttt 


atggcgggtc 


tgctatgtgg 


tgctatctga 


ct tttcgctg 


ttcagcagtt 


10680 


cctgccctct 


gattttccag 


tctgaccact 


teggattate 


ccgtgacagg 


tcattcagac 


10740 


tggctaatgc 


acccagtaag 


geageggtat 


catcaacagg 


cttacccgtc 


ttactgtcaa 


10800 


c 












10801 



<210> 77 
<211> 8654 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Recombinant 
pAN236 plasmid 

<400> 77 

ctcgaggcct acctagcttc caagaaagat atcctaacag cacaagagcg gaaagatgtt 60 

ttgttctaca tccagaacaa cctctgctaa aattcctgaa aaattttgea aaaagttgtt 120 

gactttatct acaaggtgtg gtataataat cttaacaaca geaggacget ctagaggagg 180 

agacatcatg aaaattggaa ttateggegg aggctcegtt ggtcttttat gegectatta 240 

tttgtcactt tatcacgacg tgactgttgt gaegaggegg caagaacagg ctgcggccat 300 

tcagtctgaa ggaatcegge tttataaagg eggggaggaa ttcagggctg attgcagtgc 360 
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ggacacgagt 


atcaattcgg 


actttgacct 


gcttgtcgtg 


acagtgaagc 


agcatcagct 


420 


tcaatctgtt 


ttttcgtcgc 


ttgaacgaat 


cgggaagacg 


aatatattat 


ttttgcaaaa 


480 


cggcatgggg 


catatccacg 


acctaaaaga 


ctggcacgtt 


ggccattcca 


tttatgttgg 


540 


aatcgttgag 


cacggagctg 


taagaaaatc 


ggatacagct 


gttgatcata 


caggcctagg 


600 


tgcgataaaa 


tggagcgcgt 


tcgacgatgc 


tgaaccagac 


cggctgaaca 


tcttgtttca 


660 


gcataaccat 


tcggattttc 


cgatttatta 


tgagacggat 


tggtaccgtc 


tgctgacggg 


720 


caagctgatt 


gtaaatgcgt 


gtattaatcc 


tttaactgcg 


ttattgcaag 


tgaaaaatgg 


780 


agaactgctg 


acaacgccag 


cttatctggc 


ttttatgaag 


ctggtatttc 


aggaggcatg 


840 


ccgcatttta 


aaacttgaaa 


atgaagaaaa 


ggcttgggag 


cgggttcagg 


ccgtttgtgg 


900 


gcaaacgaaa 


gagaatcgtt 


catcaatgct 


ggttgacgtc 


attggaggcc 


ggcagacgga 


960 


agctgacgcc 


attatcggat 


acttattgaa 


ggaagcaagt 


cttcaaggtc 


ttgatgccgt 


1020 


ccacctagag 


tttttatatg 


gcagcatcaa 


agcattggag 


cgaaatacaa 


acaaagtctt 


1080 


ttgagctttt 


tcggtaacat 


gctatactca 


tttcggatca 


ctaactattt 


attggaggat 


1140 


cctgttttgg 


cggatgagag 


aagattttca 


gcctgataca 


gattaaatca 


gaacgcagaa 


1200 


gcggtctgat 


aaaacagaat 


ttgcctggcg 


gcagtagcgc 


ggtggtccca 


cctgacccca 


1260 


tgccgaactc 


agaagtgaaa 


cgccgtagcg 


ccgatggtag 


tgtggggtct 


ccccatgcga 


1320 


gagtagggaa 


ctgccaggca 


tcaaataaaa 


cgaaaggctc 


agtcgaaaga 


ctgggccttt 


1380 


cgttttatct 


gttgtttgtc 


ggtgaacgct 


ctcctgagta 


ggacaaatcc 


gccgggagcg 


1440 


gatttgaacg 


ttgcgaagca 


acggcccgga 


gggtggcggg 


caggacgccc 


gccataaact 


1500 


gccaggcatc 


aaattaagca 


gaaggccatc 


ctgacggatg 


gcctttttgc 


gtttctacaa 


1560 


actcttggta 


cccagaaaaa 


gcggcaaaag 


cggctgttaa 


aaaagcgaaa 


tcgaagaagc 


1620 


tgtctgccgc 


taagacggaa 


tatcaaaagc 


gttctgctgt 


tgtgtcatct 


ttaaaagtca 


1680 


cagccgatga 


atcccagcaa 


gatgtcctaa 


aatacttgaa 


cacccagaaa 


gataaaggaa 


1740 


atgcagacca 


aattcattct 


tattatgtgg 


tgaacgggat 


tgctgttcat 


gcctcaaaag 


1800 


aggttatgga 


aaaagtggtg 


cagtttcccg 


aagtggaaaa 


ggtgcttcct 


aatgagaaac 


1860 


ggcagctttt 


taagtcatcc 


tccccattta 


atatgaaaaa 


agcacagaaa 


gctattaaag 


1920 


caactgacgg 


tgtggaatgg 


aatgtagacc 


aaatcgatgc 


cccaaaagct 


tgggcacttg 


1980 


gatatgatgg 


aactggcacg 


gttgttgcgt 


ccattgatac 


cggggtggaa 


tggaatcatc 


2040 


cggcattaaa 


agagaaatat 


cgcggatata 


atccggaaaa 


tcctaatgag 


cctgaaaatg 


2100 
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aaatgaactg 


gtatgatgcc 


gaacccacgt 


gacaggcacg 


tagcacctgg 


cgcaaaatgg 


ctgacatttt 


ggaagctggt 


acccggaaat 


ggctcctgat 


aatggtacag 


agacatggtc 


cggggaatac 


ggatctcttt 


atccagaatc 


gtttgcaact 


aaatcgctaa 


tgttgattac 


gagtaaaaga 


ttgtgctgaa 


agttgtatcg 


agtgtggttt 


caatgaaaca 


tggcattcag 


cagttcgttg 


gttgtttctc 


ataagagttt 


gtcagatatg 


ataaaaatct 


tgttggtttt 


cttataatca 


gcacatgcat 


aaaactacgt 


gaatcaaaaa 


atgatccaaa 


tgtaaaagtt 


aatcggcaat 


tgacgaaact 


atgaagaaaa 


gaatttgaaa 


taatctccta 


tggtggtttg 


aagaaggcga 


tttgattcat 


ttattgcaat 


gtggaattgg 


gggccatatt 


gttgtataag 


aaatgtttta 


atcaagttta 


caattaaaag 


agggaagcgt 


gggtgaaatg 


tgaatacatc 


tggctttgca 


ttttatcttt 


cctgatattg 


caaatgattt 



-97- 

gtagcaggcg aggcaagccc 
atggtgggct ctgaacctga 
attgctgtta aagcgttctc 
gaatgggttt tagcaccaaa 
gttgtcaata actcatgggg 
aatgcctggc gttcggccga 
attcccggcg ggcctggttc 
ggagcgactg agaattccaa 
tttgaacttc tgcatattct 
atattagagt ataaacaaaa 
tgtaaatcca ggctttgtcc 
tcacaaaagg ttgttgctga 
acattaacag ttaaaaatgt 
gctcaaggat ttcgccgaat 
atgcgtgcaa cggaagtgac 
gtattggtat gtgtggaacc 
caatggattc aattttggaa 
caaatgattc gaccgaaaaa 
gcaaaatatc ctgtaaagga 
cgtttgtctg atttggagga 
ttaaaagaaa tacataaaaa 
acagatgatg acgaaaaagc 
gaacggaaaa attattttat 
tgatgaaata ctgaatttaa 
ggaggaatta attatgaagt 
atcattaacc ctataaacta 
ctattcacaa tcgaatttac 
ttttagcgta ttaaatgaaa 
taataaacca cctgcgagta 
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ttatgatgat ttggctcatg 2160 
tggaacaaat caaatcggtg 2220 
tgaagatggc ggcactgatg 2280 
ggacgcggaa ggaaatcccc 2340 
agggggctct ggacttgatg 2400 
tattttccct gagttttcag 24 60 
tatcgcaaat ccggcaaact 2520 
ttccccatgg agagaaaaga 2580 
tgaatttaaa aaggctgaaa 2640 
tcgtgaaaca ggcgaaagaa 2700 
aatgtgcaac tggaggagag 27 60 
agttattaaa caaaagccaa 2820 
ttatgatggc gaagaattaa 2880 
gatgcaatat aaaaaaatta 2940 
aataaataat aaagataatt 3000 
aacttatttt aagaatacag 3060 
aaaggcaatg aaattagact 3120 
taaatataaa tcggatatac 3180 
tacggatttt atgaccgatg 3240 
aggtttacac cgtaaaaggt 3300 
attaaacctt gatgacacag 3360 
cgatgaagat ggattttcta 3420 
taaagagtag ttcaacaaac 3480 
aacttagttt atatgtggta 3540 
gtaatgaatg taacagggtt 3600 
cgtctgccct cattattgga 3660 
gacacaacca aattttaatt 3720 
tggttttgaa cgtctcatta 3780 
caaactgggt gaacacagcc 3840 
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tttatgttaa ccttttccat tggaacagct 
atcaaaaggt tactcctatt tggaattata 
gttggccatt ctttcttttc cttacttatt 
gctgcatttc cagcactcgt aatggttgta 
ggtaaagcat ttggtcttat tggatcgata 
attggtggaa tgatagccca ttatattcat 
acaattatca ctgttccgtt tcttatgaaa 
cattttgata tcaaaggaat tatactaatg 
acaacatcat atagcatttc ttttcttatc 
aaacatatca ggaaagtaac agatcctttt 
tttatgattg gagttctttg tgggggaatt 
atggttcctt atatgatgaa agatgttcac 
attattttcc ctggaacaat gagtgtcatt 
gatagaagag gtcctttata cgtgttaaac 
ttaactgctt cctttctttt agaaacaaca 
gttttaggtg ggctttcgtt caccaaaaca 
aaacagcagg aagctggtgc tggaatgagt 
ggaacaggta ttgcaattgt aggtggttta 
ttacctatgg aagttgatca gtcaacttat 
ggaatcattg tcattagttg gctggttacc 
ttctaaatcg ttaagggatc aactttggga 
cagttcgaag cggccgcaat tcttgaagac 
aggttaatgt catgataata atggtttctt 
tgcgcggaac ccctatttgt ttatttttct 
gacaataacc ctgataaatg cttcaataat 
atttccgtgt cgcccttatt cccttttttg 
cagaaacgct ggtgaaagta aaagatgctg 
tcgaactgga tctcaacagc ggtaagatcc 
caatgatgag cacttttaaa gttctgctat 
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gtatatggaa 


agctatctga 


tcaattaggc 


3900 


ataaattgtt 


tcgggtcggt 


aattgggttt 


3960 


atggctcgtt 


ttattcaagg 


ggctggtgca 


4020 


gttgcgcgct 


atattccaaa 


ggaaaatagg 


4080 


gtagccatgg 


gagaaggagt 


cggtccagcg 


4140 


tggtcctatc 


ttctactcat 


tcctatgata 


4200 


ttattaaaga 


aagaagtaag 


gataaaaggt 


4260 


tctgtaggca 


ttgtattttt 


tatgttgttt 


4320 


gttagcgtgc 


tgtcattcct 


gatatttgta 


4380 


gttgatcccg 


gattagggaa 


aaatatacct 


4440 


atatttggaa 


cagtagcagg 


gtttgtctct 


4500 


cagctaagta 


ctgccgaaat 


cggaagtgta 


4560 


attttcggct 


acattggtgg 


gatacttgtt 


4620 


atcggagtta 


catttctttc 


tgttagcttt 


4680 


tcatggttca 


tgacaattat 


aatcgtattt 


4740 


gttatatcaa 


caattgtttc 


aagtagcttg 


4800 


ttgcttaact 


ttaccagctt 


tttatcagag 


4860 


ttatccatac 


ccttacttga 


tcaaaggttg 


4920 


ctgtatagta 


atttgttatt 


acttttttca 


4980 


ttgaatgtat 


ataaacattc 


tcaaagggat 


5040 


gagagttcaa 


aattgatcct 


ttttttataa 


5100 


gaaagggcct 


cgtgatacgc 


ctatttttat 


5160 


agacgtcagg 


tggcactttt 


cggggaaatg 


5220 


aaatacattc 


aaatatgtat 


ccgctcatga 


5280 


attgaaaaag 


gaagagtatg 


agtattcaac 


5340 


cggcattttg 


ccttcctgtt 


tttgctcacc 


5400 


aagatcagtt 


gggtgcacga 


gtgggttaca 


5460 


ttgagagttt 


tcgccccgaa 


gaacgttttc 


5520 


gtggcgcggt 


attatcccgt 


attgacgccg 


5580 
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ggcaagagca 


actcggtcgc 


cgcatacact 


attctcagaa 


tgacttggtt 


gagtactcac 


5640 


cagtcacaga 


aaagcatctt 


acggatggca 


tgacagtaag 


agaattatgc 


agtgctgcca 


5700 


taaccatgag 


tgataacact 


gcggccaact 


tacttctgac 


aacgatcgga 


ggaccgaagg 


5760 


agctaaccgc 


ttttttgcac 


aacatggggg 


atcatgtaac 


tcgccttgat 


cgttgggaac 


5820 


cggagctgaa 


tgaagccata 


ccaaacgacg 


agcgtgacac 


cacgatgcct 


gcagcaatgg 


5880 


caacaacgtt 


gcgcaaacta 


ttaactggcg 


aactacttac 


tctagcttcc 


cggcaacaat 


5940 


taatagactg 


gatggaggcg 


gataaagttg 


caggaccact 


tctgcgctcg 


gcccttccgg 


6000 


ctggctggtt 


tattgctgat 


aaatctggag 


ccggtgagcg 


tgggtctcgc 


ggtatcattg 


6060 


cagcactggg 


gccagatggt 


aagccctccc 


gtatcgtagt 


tatctacacg 


acggggagtc 


6120 


aggcaactat 


ggatgaacga 


aatagacaga 


tcgctgagat 


aggtgcctca 


ctgattaagc 


6180 


attggtaact 


gtcagaccaa 


gtttactcat 


atatacttta 


gattgattta 


aaacttcatt 


6240 


tttaatttaa 


aaggatctag 


gtgaagatcc 


tttttgataa 


tctcatgacc 


aaaatccctt 


6300 


aacgtgagtt 


ttcgttccac 


tgagcgtcag 


accccgtaga 


aaagatcaaa 


ggatcttctt 


6360 


gagatccttt 


ttttctgcgc 


gtaatctgct 


gcttgcaaac 


aaaaaaacca 


ccgctaccag 


6420 


cggtggtttg 


tttgccggat 


caagagctac 


caactctttt 


tccgaaggta 


actggcttca 


6480 


gcagagcgca 


gataccaaat 


actgtccttc 


tagtgtagcc 


gtagttaggc 


caccacttca 


6540 


agaactctgt 


agcaccgcct 


acatacctcq 


ctctgctaat 


cctyttacca 


gtggctgctg 


6600 


ccagtggcga 


taagtcgtgt 


cttaccgggt 


tggactcaag 


acgatagtta 


ccggataagg 


6660 


cgcagcggtc 


gggctgaacg 


gggggttcgt 


gcacacagcc 


cagcttggag 


cgaacgacct 


6720 


acaccgaact 


gagataccta 


cagcgtgagc 


tatgagaaag 


cgccacgctt 


cccgaaggga 


6780 


gaaaggcgga 


caggtatccg 


gtaagcggca 


gggtcggaac 


aggagagcgc 


acgagggagc 


6840 


ttccaggggg 


aaacgcctgg 


tatctttata 


gtcctgtcgg 


gtttcgccac 


ctctgacttg 


6900 


agcgtcgatt 


tttgtgatgc 


tcgt cagggg 


ggcggagcct 


atggaaaaac 


gccagcaacg 


6960 


cggccttttt 


acggttcctg 


gccttttgct 


ggccttttgc 


tcacatgttc 


tttcctgcgt 


7020 


tatcccctga 


ttctgtggat 


aaccgtatta 


ccgcctttga 


gtgagctgat 


accgctcgcc 


7080 


gcagccgaac 


gaccgagcgc 


agcgagtcag 


tgagcgagga 


agcggaagag 


cgcctgatgc 


7140 


ggtattttct 


ccttacgcat 


ctgtgcggta 


tttcacaccg 


catatggtgc 


actctcagta 


7200 


caatctgctc 


tgatgccgca 


tagttaagcc 


agtatacact 


ccgctatcgc 


tacgtgactg 


7260 


ggtcatggct 


gcgccccgac 


acccgccaac 


acccgctgac 


gcgccctgac 


gggcttgtct 


7320 
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gctcccggca 


tccgcttaca 


gacaagctgt 


gaccgtctcc 


gggagctgca 


tgtgtcagag 


7380 


gttttcaccg 


tcatcaccga 


aacgcgcgag 


gcagctgcgg 


taaagctcat 


cagcgtggtc 


7440 


gtgaagcgat 


tcacagatgt 


ctgcctgttc 


atccgcgtcc 


agctcgttga 


gtttctccag 


7500 


aagcgttaat 


gtctggcttc 


tgataaagcg 


ggccatgtta 


agggcggttt 


tttcctgttt 


7560 


ggtcacttga 


tgcctccgtg 


taagggggaa 


tttctgttca 


tgggggtaat 


gataccgatg 


7620 


aaacgagaga 


ggatgctcac 


gatacgggtt 


actgatgatg 


aacatgcccg 


gttactggaa 


7680 


cgttgtgagg 


gtaaacaact 


ggcggtatgg 


atgcggcggg 


accagagaaa 


aatcactcag 


7740 


ggtcaatgcc 


agcgcttcgt 


taatacagat 


gtaggtgttc 


cacagggtag 


ccagcagcat 


7800 


cctgcgatgc 


agatccggaa 


cataatggtg 


cagggcgctg 


acttccgcgt 


ttccagactt 


7860 


tacgaaacac 


ggaaaccgaa 


gaccattcat 


gttgttgctc 


aggtcgcaga 


cgttttgcag 


7920 


cagcagtcgc 


ttcacgttcg 


ctcgcgtatc 


ggtgattcat 


tctgctaacc 


agtaaggcaa 


7980 


ccccgccagc 


ctagccgggt 


cctcaacgac 


aggagcacga 


tcatgcgcac 


ccgtggccag 


8040 


gacccaacgc 


tgcccgagat 


gcgccgcgtg 


cggctgctgg 


agatggcgga 


cgcgatggat 


8100 


atgttctgcc 


aagggttggt 


ttgcgcattc 


acagttctcc 


gcaagaattg 


attggctcca 


3160 


attcttggag 


tggtgaatcc 


gttagcgagg 


tgccgccggc 


t tccattcag 


gtcgaggtgg 


8220 


cccggctcca 


tgcaccgcga 


cgcaacgcgg 


ggaggcagac 


aaggtatagg 


gcggcgccta 


.8280 


caat'ccatgc 


caacccgttc 


catgtgctcg 


ccgaggcggc 


ataaatcgcc 


gtgacgatca 


8340 


gcggtccagt 


gatcgaagtt 


aggctggtaa 


gagccgcgag 


cgatccttga 


agctgtccct 


8400 


gatggtcgtc 


atctacctgc 


ctggacagca 


tggcctgcaa 


cgcgggcatc 


ccgatgccgc 


8460 


cggaagcgag 


aagaatcata 


atggggaagg 


ccatccagcc 


tcgcgtcggc 


ggccgcttcg 


8520 


tcgaccgaaa 


cagcagttat 


aaggcatgaa 


gctgtccggt 


ttttgcaaaa 


gtggctgtga 


8580 


ctgtaaaaag 


aaatcgaaaa 


agaccgtttt 


gtgtgaaaac 


ggtctttttg 


tttcctttta 


8640 


accaactgcc 


ataa 
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qqqaaccqqa 


gctgaatgaa 


gccataccaa 


acqacqaqcq 


tgacaccacg 


atgcctgcag 


5460 


caatggcaac 


aacgttgcgc 


aaactattaa 


ctggcgaact 


acttactcta 


gcttcccggc 


5520 


aacaattaat 


aoactaaata 


qaqqcqqata 


aagttgcagg 


accacttctg 


cgctcggccc 


5580 


ttccaactaq 


ctggtttatt 


gctgataaat 


ctqqaqccqq 


tqaqcqtqqq 


tetcgeggta 


5640 


teat tgeage 


actggggeca 


gatggtaagc 


cctcccgtat 


cgtagttatc 


tacacgaegg 


5700 


ggagtcaggc 


aactat ggat 


gaacgaaata 


gaeagatege 


tgagataggt 


gcctcactga 


5760 


t taagca ttg 


gt aact gt ca 


gaccaagt tt 


act cat a tat 


actttagatt 


gat t t aaaac 


5820 


ttcattttta 


att t aaaagg 


atctaaatoa 


aga t cct 1 1 1 


tgat aatctc 


atgaccaaaa 


5880 


tccct t aacg 


tgagtt ttcg 


t tccactgag 


cgt cagaccc 


cgtagaaaag 


atcaaaggat 


5940 




tccttttttt 


ctacacataa 


tetgetgett 


gcaaacaaaa 


aaaccaccgc 


6000 


taccageggt 


ggtttgtttg 


ccggatcaag 


agctaccaac 


tctt t ttccg 


aaggtaactg 


6060 


get tcagcag 


agegcagat a 


ccaaa t act g 


t cctt ctagt 


gtagcegtag 


t taggccacc 


612C 


acttcaagaa 


ctctgt agca 


ccgcctacat 


acctcgctct 


gctaatcctg 


ttaccag tqg 


6180 


c tgctgccag 


tggcgataag 


tcgtgtctta 


ccqqattqqa 


ctcaagacga 


taq ttaccqq 


6240 


ataaggegea 


qcqqt cqqqc 


tqaacqqqqq 


gtt cgtgcac 


acagcccagc 


ttqqaqcqaa 


6300 


cgacctacac 


cgaactgaga 


tacctacagc 


qtqaqctatq 


agaaagegee 


acgcttcccg 


6360 


aaqqqaqaaa 


qqcqqacaqq 


tatceggtaa 


qcqqcaqqqt 


cqqaacaqqa 

Z7 J J J 


qaqcqcacqa 

Z> ZJ " z3 


6420 


qqqaqcttcc 

ZJ Z) Zi ^3 


aqqqqqaaac 


gectggtate 


tttatagtcc 


tgtcgggttt 


cgccacctct 


6480 


gaettgageg 


tcgatttttg 


tgatgctcgt 


caggggggcg 


gagectatgg 


aaaaacgeca 


6540 


gcaacgcggc 


etttttaegg 


ttcctggcct 


tttgctggcc 


ttttgetcac 


atgttctttc 


6600 


ctgcgttatc 


ccctgattct 


gtggataacc 


gtattacege 


ctttgagtga 


gctgataccg 


6660 


ctcgccgcag 


ccgaacgacc 


gagegcageg 


agtcagtgag 


egaggaageg 


gaagagegee 


67 ZU 


tgatgcggta 


ttttctcctt 


aegcatctgt 


geggtattte 


acaccgcata 


tggtgcactc 


6780 


tcagtacaat 


ctgetctgat 


geegcatagt 


taagccagta 


tacactccgc 


tategctacg 


6840 


tgactgggtc 


atggctgcgc 


cccgacaccc 


gccaacaccc 


gctgacgcgc 


cctgacgggc 


6900 


ttgtctgctc 


ccggcatccg 


cttacagaca 


agctgtgacc 


gtctccggga 


gctgcatgtg 


6960 


tcagaggttt 


tcaccgtcat 


caccgaaacg 


cgcgaggcag 


ctgcggtaaa 


gctcatcagc 


7020 
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at aa t aat na 
y Lyy ut-y uya 


arrcaa t tcac 


aaaf at at fir" 


at at t ratcc 


neat craart" 


L>y t tyay t c l 


7080 




nttA^tatf^t" 

\J I— l_ CI Q L- ^ L 


yy V<L LUL^dL 


ddayv^yy y 


cii_y l_ Laayyy 


uyy lit lllu 


7140 




aotta^taar* 


LL>uy uy Ldcty 


n a ct n ^ 3 f- +* •h 
yyyydau i_ 


t at t r*p t aaa 
l. y u Lua tyy y 


not" aaf naf a 
yytdduydLd 


1700 


ttya Lyadat 


yayayayyci u 


y^LL>auyciLd 


y y uidi-uy 


afnat 1 rraar*a 
a uydtyddud 


tar^r^/^rmtt a 
lytt-Lyy l id 


i"? fid 




gtgciyyy cad 


a<-,aci(_L.yyi_y 


y latyyaLyu 


y y ey y y acca 


yagaadadLC 


i *}9n 




aaugccagcg 


C L L Cg L. CddL 


dCdyatytdg 


yiy LtCCdCa 


ggg uagccag 


1 "}Qfi 
/ jOU 


—n ^ a 4- a a 4* 

cagcdLCccg 


cgatgcagat 


ceggaacara 


atggtgcagg 


gegctgact t 


ccgcgt ttcc 


1 A A ft 
/ 3 H. U 




dadCdCyyad 


occyaayacc 


dtLCdLy t ty 


t tyCLCdyy t 


cgcagacg t u 


/ jUU 


ttgcagcagc 


agtcgcttca 


cgtt egcteg 


cgtatcggtg 


dLtCdltCuy 


ct aaccagt a 


"7 en 


aggcaacccc 


gccagcctag 


ccgggtcct c 


aacgacagga 


gcacgatcat 


gcgcacccgt 


/ bzU 


ggccaggacc 


caacgctgcc 


cgagat gege 


cgcgtgcggc 


tgctggagat 


ggcggacgcg 


n CLQ ft 

/ boU 


atggatat gt 


tctgccaagg 


gt tggtttgc 


gcat tcacag 


ttctccgcaa 


gaattga tt g 


"7 "7 / ft 


gctccaattc 


ttggagtggt 


gaatccgtta 


gcgaggtgcc 


gccggcttcc 


attcaggtcg 


7800 


aggtggcccg 


gctccatgca 


ccgcgacgca 


aegeggggag 


gcagacaagg 


tatagggegg 


7860 


cgcctacaat 


ccatgccaac 


ccgttccatg 


tgctcgccga 


ggcggcataa 


atcgccgtga 


7920 


cgatcagcgg 


tccagtgatc 


gaagttaggc 


tggtaagagc 


cgegagegat 


ccttgaagct 


7980 


gtccctgatg 


gtcgtcatct 


acctgcctgg 


acagcatggc 


ctgcaacgcg 


ggcatcccga 


8040 


tgccgccgga 


agcgagaaga 


atcataatgg 


ggaaggecat 


ccagcctcgc 


gtc 


8093 



<210> 79 
<211> 8098 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Recombinant 
pAN429 plasmid 

<400> 79 



ggcggccgct 


tcgtcgaccg 


aaacagcagt 


tataaggcat 


gaagctgtcc 


ggtttttgca 


60 


aaagtggctg 


tgactgtaaa 


aagaaatcga 


aaaagaccgt 


tttgtgtgaa 


aacggtcttt 


120 


ttgtttcctt 


ttaaccaact 


gccataactc 


gaggcctacc 


tagcttccaa 


gaaagatatc 


180 


ctaacagcac 


aagagcggaa 


agatgttttg 


ttctacatcc 


agaacaacct 


ctgctaaaat 


240 


tcctgaaaaa 


ttttgcaaaa 


agttgttgac 


tttatctaca 


aggtgtggta 


taataatctt 


300 
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aacaacagca 


ggacgctcta 


gattagaaag 


gaggtttaat 


taatgtatcg 


tacgatgatg 


360 


agcggcaaac 


ttcacagggc 


aactgttacg 


gaagcaaacc 


tgaactatgt 


gggaagcatt 


420 


acaattgatg 


aagatctcat 


tgatgctgtg 


ggaatgcttc 


ctaatgaaaa 


agtacaaatt 


480 


gtgaataata 


ataatggagc 


acgtcttgaa 


acgtatatta 


ttcctggtaa 


acggggaagc 


540 


ggcgtcatat 


gcttaaacgg 


tgcagccgca 


cgccttgtgc 


aggaaggaga 


taaggtcatt 


600 


attatttcct 


acaaaatgat 


gtctgatcaa 


gaagcggcaa 


gccatgagcc 


gaaagtggct 


660 


gttctgaatg 


atcaaaacaa 


aattgaacaa 


atgctgggga 


acgaaccagc 


ccgtacaatt 


720 


ttgtaaagga 


tcctgttttg 


gcggatgaga 


gaagattttc 


agcctgatac 


agattaaatc 


780 


agaacgcaga 


agcggtctga 


taaaacagaa 


tttgcctggc 


ggcagtagcg 


cggtggtccc 


840 


acctgacccc 


atgccgaact 


cagaagtgaa 


acgccgtagc 


gccgatggta 


gtgtggggtc 


900 


tccccatgcg 


agagtaggga 


actgccaggc 


atcaaataaa 


acgaaaggct 


cagtcgaaag 


960 


actgggcctt 


tcgttttatc 


tgttgtttgt 


cggtgaacgc 


tctcctgagt 


aggacaaatc 


1020 


cgccgggagc 


ggatttgaac 


gttgcgaagc 


aacggcccgg 


agggtggcgg 


gcaggacgcc 


1080 


cgccataaac 


tgccaggcat 


caaattaagc 


agaaggccat 


cctgacggat 


ggcctttttg 


1140 


cgtttctaca 


aactcttggt 


acccagaaaa 


agcggcaaaa 


gcggctgt ta 


aaaaagcgaa 


12C0 


atcgaagaag 


ctgtctgccg 


ctaagacgga 


atatcaaaag 


cgttctgctg 


ttgtgtcatc 


1260 


tttaaaagtc 


acagccgatg 


aatcccagca 


agatgtccta 


aaatacttga 


acacccagaa 


1320 


agataaagga 


aatgcagacc 


aaattcattc 


ttattatgtg 


gtgaacggga 


ttgctgttca 


1380 


tgcctcaaaa 


gaggttatgg 


aaaaagtggt 


gcagtttccc 


gaagtggaaa 


aggtgcttcc 


1440 


taatgagaaa 


cggcagcttt 


ttaagtcatc 


ctccccattt 


aatatgaaaa 


aagcacagaa 


1500 


agctattaaa 


gcaactgacg 


gtgtggaatg 


gaatgtagac 


caaatcgatg 


ccccaaaagc 


1560 


ttgggcactt 


ggatatgatg 


gaactggcac 


ggttgttgcg 


tccattgata 


ccggggtgga 


1620 


atggaatcat 


ccggcattaa 


aagagaaata 


tcgcggatat 


aatccggaaa 


atcctaatga 


1680 


gcctgaaaat 


gaaatgaact 


ggtatgatgc 


cgtagcaggc 


gaggcaagcc 


cttatgatga 


1740 


tttggctcat 


ggaacccacg 


tgacaggcac 


gatggtgggc 


tctgaacctg 


atggaacaaa 


1800 


tcaaatcggt 


gtagcacctg 


gcgcaaaatg 


gattgctgtt 


aaagcgttct 


ctgaagatgg 


1860 


cggcactgat 


gctgacattt 


tggaagctgg 


tgaatgggtt 


ttagcaccaa 


aggacgcgga 


1920 


aggaaatccc 


cacccggaaa 


tggctcctga 


tgttgtcaat 


aactcatggg 


gagggggctc 


1980 


tggacttgat 


gaatggtaca 


gagacatggt 


caatgcctgg 


cgttcggccg 


atattttccc 


2040 
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tgagttttca 


gcggggaata 


tccggcaaac 


tatccagaat 


gagagaaaag 


aaaatcgcta 


aaaggctgaa 


agagtaaaag 


aggcgaaaga 


aagttgtatc 


ctggaggaga 


gcaatgaaac 


acaaaagcca 


acagttcgtt 


cgaagaatta 


aataagagtt 


taaaaaaatt 


aataaaaatc 


taaagataat 


tcttataatc 


taagaataca 


gaaaactacg 


gaaattagac 


tatgatccaa 


atcggatata 


caa tcggcaa 


tatgaccgat 


gatgaagaaa 


ccgtaaaagg 


ttaatctcct 


tgatgacaca 


gaagaaggcg 


tggattttct 


attattgcaa 


gttcaacaaa 


cgggccatat 


tatatgtggt 


aaaatgtttt 


gtaacagggt 


tcaattaaaa 


tcattattgg 


agggtgaaat 


aaattttaat 


ttggctttgc 


acgtctcatt 


acctgatatt 


tgaacacagc 


ctttatgtta 


atcaattagg 


catcaaaagg 


taattgggtt 


tgttggccat 


gggctggtgc 


agctgcattt 


aggaaaatag 


gggtaaagca 


tcggtccagc 


gattggtgga 
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cggatctctt tattcccggc 
cgtttgcaac tggagcgact 
atgttgatta ctttgaactt 
attgtgctga aatattagag 
gagtgtggtt ttgtaaatcc 
atggcattca gtcacaaaag 
ggttgtttct cacattaaca 
tgtcagatat ggctcaagga 
ttgttggttt tatgcgtgca 
agcacatgca tgtattggta 
tgaatcaaaa acaatggatt 
atgtaaaagt tcaaatgatt 
ttgacgaaac tgcaaaatat 
agaatttgaa acgtttgtct 
atggtggttt gttaaaagaa 
atttgattca tacagatgat 
tgtggaattg ggaacggaaa 
tgttgtataa gtgatgaaat 
aatcaagttt aggaggaatt 
gagggaagcg tatcattaac 
gtgaatacat cctattcaca 
attttatctt tttttagcgt 
gcaaatgatt ttaataaacc 
accttttcca ttggaacagc 
ttactcctat ttggaattat 
tctttctttt ccttacttat 
ccagcactcg taatggttgt 
tttggtctta ttggatcgat 
atgatagccc attatattca 
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gggcctggtt 


ctatcgcaaa 


2100 


gagaattcca 


attccccatg 


2160 


ctgcatattc 


ttgaatttaa 


2220 


tataaacaaa 


atcgtgaaac 


2280 


aggctttgtc 


caatgtgcaa 


2340 


gttgttgctg 


aagttattaa 


2400 


gttaaaaatg 


tttatgatgg 


2460 


tttcgccgaa 


tgatgcaata 


2520 


acggaagtga 


caataaataa 


2580 


tgtgtggaac 


caacttattt 


2640 


caattttgga 


aaaaggcaat 


2700 


cgaccgaaaa 


ataaatataa 


2760 


cctgtaaagg 


atacggattt 


2820 


gatttggagg 


aaggt ttaca 


2880 


atacataaaa 


aattaaacct 


2940 


gacgaaaaag 


ccgatgaaga 


3000 


aattatttta 


ttaaagagta ' 


3060 


actgaattta 


aaacttagtt 


3120 


aattatgaag 


tgtaatgaat 


3180 


cctataaact 


acgtctgccc 


3240 


atcgaattta 


cgacacaacc 


3300 


attaaatgaa 


atggttttga 


3360 


acctgcgagt 


acaaactggg 


3420 


tgtatatgga 


aagctatctg 


3480 


aataaattgt 


ttcgggtcgg 


3540 


tatggctcgt 


tttattcaag 


3600 


agttgcgcgc 


tatattccaa 


3660 


agtagccatg 


ggagaaggag 


3720 


ttggtcctat 


cttctactca 


3780 
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ttcctatgat 


aacaattatc 


actgttccgt 


ttcttatgaa 


attattaaag 


aaagaagtaa 


3840 


ggataaaagg 


tcattttgat 


atcaaaggaa 


ttatactaat 


gtctgtaggc 


attgtatttt 


3900 


ttatgttgtt 


tacaacatca 


tatagcattt 


cttttcttat 


cgttagcgtg 


ctgtcattcc 


3960 


tgatatttgt 


aaaacatatc 


aggaaagtaa 


cagatccttt 


tgttgatccc 


ggattaggga 


4020 


aaaatatacc 


ttttatgatt 


ggagttcttt 


gtgggggaat 


tatatttgga 


acagtagcag 


4080 


ggtttgtctc 


tatggttcct 


tatatgatga 


aagatgttca 


ccagctaagt 


actgccgaaa 


4140 


tcggaagtgt 


aattattttc 


cctggaacaa 


tgagtgtcat 


tattttcggc 


tacattggtg 


4200 


ggatacttgt 


tgatagaaga 


ggtcctttat 


acgtgttaaa 


catcggagtt 


acatttcttt 


4260 


ctgttagctt 


tttaactgct 


tcctttcttt 


tagaaacaac 


atcatggttc 


atgacaatta 


4320 


taatcgtatt 


tgttttaggt 


gggctttcgt 


tcaccaaaac 


agttatatca 


acaattgttt 


4380 


caagtagctt 


gaaacagcag 


gaagctggtg 


ctggaatgag 


tttgcttaac 


tttaccagct 


4440 


ttttatcaga 


gggaacaggt 


attgcaattg 


taggtggttt 


attatccata 


cccttacttg 


4500 


atcaaaggtt 


gttacctatg 


gaagttgatc 


agtcaactta 


tctgtatagt 


aatttgttat 


4560 


tacttttttc 


aggaatcatt 


gtcattagtt 


ggctggttac 


cttgaatgta 


tataaacatt 


4620 


ctcaaaggga 


tttctaaatc 


gttaagggat 


caactttggg 


agagagttca 


aaattgatcc 


4680 


tttttttata 


acagttcgaa 


gcggccgcaa 


ttcttgaaga 


cgaaagggcc 


tcgtgatacg 


4740 


cctattttta 


taggttaatg 


tcatgataat 


aatggtttct 


tagacgtcag 


gtggcacttt 


4800 


tcggggaaat 


gtgcgcggaa 


cccctatttg 


tttatttttc 


taaatacatt 


caaatatgta 


4860 


tccgctcatg 


agacaataac 


cctgataaat 


gcttcaataa 


tattgaaaaa 


ggaagagtat 


4920 


gagtattcaa 


catttccgtg 


tcgcccttat 


tccctttttt 


gcggcatttt 


gccttcctgt 


4980 


ttttgctcac 


ccagaaacgc 


tggtgaaagt 


aaaagatgct 


gaagatcagt 


tgggtgcacg 


5040 


agtgggttac 


atcgaactgg 


atctcaacag 


cggtaagatc 


cttgagagtt 


ttcgccccga 


5100 


agaacgtttt 


ccaatgatga 


gcacttttaa 


agttctgcta 


tgtggcgcgg 


tattatcccg 


5160 


tattgacgcc 


gggcaagagc 


aactcggtcg 


ccgcatacac 


tattctcaga 


atgacttggt 


5220 


tgagtactca 


ccagtcacag 


aaaagcatct 


tacggatggc 


atgacagtaa 


gagaattatg 


5280 


cagtgctgcc 


ataaccatga 


gtgataacac 


tgcggccaac 


ttacttctga 


caacgatcgg 


5340 


aggaccgaag 


gagctaaccg 


cttttttgca 


caacatgggg 


gatcatgtaa 


ctcgccttga 


5400 


tcgttgggaa 


ccggagctga 


atgaagccat 


accaaacgac 


gagcgtgaca 


ccacgatgcc 


5460 


tgcagcaatg 


gcaacaacgt 


tgcgcaaact 


attaactggc 


gaactactta 


ctctagcttc 


5520 
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ccggcaacaa 


ttaatagact 


ggatggaggc 


ggataaagtt 


gcaggaccac 


ttctgcgctc 


5580 


ggcccttccg 


gctggctggt 


ttattgctga 


taaatctgga 


gccggtgagc 


gtgggtctcg 


5640 


cggtatcatt 


gcagcactgg 


ggccagatgg 


taagccctcc 


cgtatcgtag 


ttatctacac 


5700 


gacggggagt 


caggcaacta 


tggatgaacg 


aaatagacag 


atcgctgaga 


taggtgcctc 


5760 


actgattaag 


cattggtaac 


tgtcagacca 


agtttactca 


tatatacttt 


agattgattt 


5820 


aaaacttcat 


ttttaattta 


aaaggatcta 


ggtgaagatc 


ctttttgata 


atctcatgac 


5880 


caaaatccct 


taacgtgagt 


tttcgttcca 


ctgagcgtca 


gaccccgtag 


aaaagatcaa 


5940 


aggatcttct 


tgagatcctt 


tttttctgcg 


cgtaatctgc 


tgcttgcaaa 


caaaaaaacc 


6000 


accgctacca 


gcggtggttt 


gtttgccgga 


tcaagagcta 


ccaactcttt 


ttccgaaggt 


6060 


aactggcttc 


agcagagcgc 


agataccaaa 


tactgtcctt 


ctagtgtagc 


cgtagttagg 


6120 


ccaccacttc 


aagaactctg 


tagcaccgcc 


tacatacctc 


gctctgctaa 


tcctgttacc 


6180 


agtggctgct 


gccagtggcg 


ataagtcgtg 


tcttaccggg 


ttggactcaa 


gacgatagtt 


6240 


accggataag 


gcgcagcggt 


cgggctgaac 


ggggggttcg 


tgcacacagc 


ccagcttgga 


6300 


gcgaacgacc 


tacaccgaac 


tgagatacct 


acagcgtgag 


ctatgagaaa 


gcgccacgct 


6360 


tcccgaaggg 


agaaaggcgg 


acaggtatcc 


ggtaagcggc 


agggtcggaa 


caggagagcg 


6420 


cacgagggag 


cttccagggg 


gaaacgcctg 


gtatctttat 


agtcctgtcg 


ggtttcgcca 


6480 


cctctgactt 


gagcgtcgat 


ttttgtgatg 


ctcgtcaggg 


gggcggagcc 


tatggaaaaa 


6540 


cgccagcaac 


gcggcctttt 


tacggttcct 


ggcct tttgc 


tggccttttg 


ctcacatgtt 


6600 


ctttcctgcg 


ttatcccctg 


attctgtgga 


taaccgtatt 


accgcctttg 


agtgagctga 


6660 


taccgctcgc 


cgcagccgaa 


cgaccgagcg 


cagcgagtca 


gtgagcgagg 


aagcggaaga 


6720 


gcgcctgatg 


cggtattttc 


tccttacgca 


tctgtgcggt 


atttcacacc 


gcatatggtg 


6780 


cactctcagt 


acaatctgct 


ctgatgccgc 


atagttaagc 


cagtatacac 


tccgctatcg 


6840 


ctacgtgact 


gggtcatggc 


tgcgccccga 


cacccgccaa 


cacccgctga 


cgcgccctga 


6900 


cgggcttgtc 


tgctcccggc 


atccgcttac 


agacaagctg 


tgaccgtctc 


cgggagctgc 


6960 


atgtgtcaga 


ggttttcacc 


gtcatcaccg 


aaacgcgcga 


ggcagctgcg 


gtaaagctca 


7020 


tcagcgtggt 


cgtgaagcga 


ttcacagatg 


tctgcctgtt 


catccgcgtc 


cagctcgttg 


7080 


agtttctcca 


gaagcgttaa 


tgtctggctt 


ctgataaagc 


gggccatgtt 


aagggcggtt 


7140 


ttttcctgtt 


tggtcacttg 


atgcctccgt 


gtaaggggga 


atttctgttc 


atgggggtaa 


7200 


tgataccgat 


gaaacgagag 


aggatgctca 


cgatacgggt 


tactgatgat 


gaacatgccc 


7260 
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ggttactgga 


acgttgtgag 


ggtaaacaac 


tggcggtatg 


gatgcggcgg 


gaccagagaa 


7320 


aaatcactca 


gggtcaatgc 


cagcgcttcg 


ttaatacaga 


tgtaggtgtt 


ccacagggta 


7380 


gccagcagca 


tcctgcgatg 


cagatccgga 


acataatggt 


gcagggcgct 


gacttccgcg 


7440 


tttccagact 


ttacgaaaca 


cggaaaccga 


agaccattca 


tgttgttgct 


caggtcgcag 


7500 


acgttttgca 


gcagcagtcg 


cttcacgttc 


gctcgcgtat 


cggtgattca 


ttctgctaac 


7560 


cagtaaggca 


accccgccag 


cctagccggg 


tcctcaacga 


caggagcacg 


atcatgcgca 


7620 


cccgtggcca 


ggacccaacg 


ctgcccgaga 


tgcgccgcgt 


gcggctgctg 


gagatggcgg 


7680 


acgcgatgga 


tatgttctgc 


caagggttgg 


tttgcgcatt 


cacagttctc 


cgcaagaatt 


7740 


gattggctcc 


aattcttgga 


gtggtgaatc 


cgttagcgag 


gtgccgccgg 


cttccattca 


7800 


ggtcgaggtg 


gcccggctcc 


atgcaccgcg 


acgcaacgcg 


gggaggcaga 


caaggtatag 


7860 


ggcggcgcct 


acaatccatg 


ccaacccgtt 


ccatgtgctc 


gccgaggcgg 


cataaatcgc 


7920 


cgtgacgatc 


agcggtccag 


tgatcgaagt 


taggctggta 


agagccgcga 


gcgatccttg 


7980 


aagctgtccc 


tgatggtcgt 


catctacctg 


cctggacagc 


atggcctgca 


acgcgggcat 


8040 


cccgatgccg 


ccggaagcga 


gaagaatcat 


aatggggaag 


gccatccagc 


ctcgcgtc 


8098 



<210> 80 
<211> 4450 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Recombinant 
pAN4 4 3 plasmid 

<400> 80 



ccatcgaatg 


gccagatgat 


taattcctaa 


tttttgttga 


cactctatca 


ttgatagagt 


60 


tattttacca 


ctccctatca 


gtgatagaga 


aaagtgaaat 


gaatagttcg 


acaaaaatct 


120 


agat tagaaa 


ggaggattta 


aatatgagac 


agattactga 


tatttcacag 


ctgaaagaag 


180 


ccataaaaca 


ataccattca 


gagggcaagt 


caatcggatt 


tgt tccgacg 


atggggtttc 


240 


tgcatgaggg 


gcatttaacc 


ttagcagaca 


aagcaagaca 


agaaaacgac 


gccgttatta 


300 


tgagtatttt 


tgtgaatcct 


gcacaattcg gccctaatga 


agattttgaa 


gcatatccgc 


360 


gcgatattga 


gcgggatgca 


gctcttgcag aaaacgccgg 


agtcgatatt 


ctttt tacgc 


420 


cagatgctca 


tgatatgtat 


cccggtgaaa 


agaatgtcac 


gattcatgta 


gaaagacgca 


480 


cagacgtgtt 


atgcgggcgc 


tcaagagaag 


gacattttga 


cggggtcgcg 


atcgtactga 


540 


cgaagctttt 


caatctagtc 


aagccgactc 


gtgcctattt 


cggtttaaaa 


gatgcgcagc 


600 
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aqqtaqctqt 


tqttqatqqq 


ttaatcagcg 


acttcttcat 


ggatattgaa 


ttggttcctg 


660 


tcgatacggt 


caqaqaqqaa 


qacqqcttaq 


ccaaaagctc 


tcgcaatgta 


tacttaacag 


720 


ctgaggaaag 


aaaagaagcg 


cctaagctgt 


at cgggccct 


tcaaacaagt 


geggaact tg 


780 


tccaagccgg 


tgaaagagat 


cctgaagcgg 


tgat aaaagc 


tgcaaaagat 


atcattgaaa 


840 


V* W V4 ^ \M 


aaccat agac 


tatgtagagc 


tttattccta 


t ccggaactc 


gagcctgtga 


900 


ataaaattaf* 


tggaaagatg 


attctcgctg 


t t gcagt t gc 


t ttttcaaaa 


gcgcgt tt a a 


960 


1" acrat Aa1~at* 


rat t"at"t" era t 


a +- f pat anaa 

Q ^- i- V-' c, a y a c* 


a aaa no t a a a 


t taatatota 


tcatacaata 


1020 


t3 c,^ciyv^yyv^. a 


aacttracaa 


V_4 VJ l — » CI CI W 1^ L, 


a v - — vj w ci ci *— » ci ci 


acrtaaacta 


tataacraaac 

C \U C ^ ^ CI C4 Cj 


1080 


A t" t" A PA A t" 1" n 




rafffiaf firf 

v>a l. a \j l 


y *— y y y a a ty v 


1" t" rrf a a t era 

c, c V— w c. ci ci luu 


aaaaataraa 

ci ci ci ci cj uav^aa 


1140 


A"t~t"ntTraat"a 

a l Lvj ty aa La 


a Laa Laa L^y 


ci y oa^y l^l 


y a a a v^»y c> ci i_c* 


c ca C- i,L<L*i.yy 


■^aaarTirrrrria 

c ci ci ci \-j vy vy ^.j ci 


1200 


a y L* y y y l. a. 


i_ a L. y L. L. LOaa 


Lyy LyLayLL 


orarrrrrt t" rt 

u oauvy l c y 


L.y\w>ciyyciciyy 


ay a uaoy y l.v^- 


1260 


a t La L La l.L L 


v_- L a L a a a a L. 


y a l. y l l. a l 


^aciyciciyi_.yy 


l>o a y L~\~ci ty o 


y iw-t-y ci a a y i- y 


1320 


y y u lu Ly o 






oaaa i_yv^cyy 




anrrratara 


1380 


□ ft" t~ t~ n i~ a ;3 


rffr^f" cccccct 
yjua i_ v^ \— y 


^^^T3"^"^ , ^ , p■t"^ , 

y y y a i_ 


yctyy *_v^yciv_<«» 


I" cic^ nncinn a 

i-yc, ciyyyyya 


v ^ a Lyy l - ^ . ■ ~~ 


1440 


d vi ' v.^ L l_ U \y CI 


c4 c^ c-. ca v , v_> v^c* 


at* *"caaaaaa 

\-4 C, — y CI d UOCJ 


1zaa taagct t 


gacctgtgaa 


gtgaaaa a t g 


1500 




' — V-4 V^- ~A Ci U L L L 


tttttatcta 


cent ttacra 


ctactacatc 


a egg a t ct cc 


1560 


a v_~ vi - vj v_- c. w 


c a \-A \—\u \j ci 


1" 1" a ri nrrinrrn 

i_ i_ a ci v_> w vj 


v-y y y «~y uy y u 


ggt t aegege 


agcgtgaccg 


1620 


ctacacttgc 


cagcgccct a 


gcgcccgctc 


etttegcttt 


ct tccct tec 


t tt ctcgcca 


1680 


cqtt caccqq 


ctttccccgt 


caagctctaa 


at cqqqqqct 


ccct ttaggg 


ttccgattta 


1740 


gtgctttacg 


gcacctcgac 


cccaaaaaac 


ttqattaqqq 


tgatggttca 


cqtaqtqqqc 


1800 


catcgccctg 


atagacggtt 


t ttcgccctt 


tqacqttqqa 


gtccacgttc 


tttaatagtg 


1860 


gactcttgtt 


ccaaactgga 


acaacactca 


accctatctc 


ggtctattct 


tttgatttat 


1920 


aagggatttt 


gccgatttcg 


gcctattggt 


taaaaaatga 


gctgatttaa 


caaaaattta 


1980 


acgcgaattt 


taacaaaata 


ttaacgctta 


caatttcagg 


tggcactttt 


eggggaaatg 


2040 


tgcgcggaac 


ccctatttgt 


ttatttttct 


aaatacattc 


aaatatgtat 


ccgctcatga 


2100 


gacaataacc 


ctgataaatg 


cttcaataat 


attgaaaaag 


gaagagtatg 


agtattcaac 


2160 


atttccgtgt 


cgcccttatt 


cccttttttg 


eggcattttg 


ccttcctgtt 


tttgctcacc 


2220 


cagaaacgct 


ggtgaaagta 


aaagatgctg 


aagatcagtt 


gggtgcacga 


gtgggttaca 


2280 


tcgaactgga 


tctcaacagc 


ggtaagatcc 


ttgagagttt 


tcgccccgaa 


gaacgttttc 


2340 
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caatgatgag 


cacttt taaa 


gttctgetat 


Qtaacacaat 


att atecegt 


attaacocco 


2400 


\_4 V4 V* Ci CI U d w ^ O 


act egg t cgc 


caratacart 

V- \-4 d \m* C4 CI Vh- 


u L L. v.-, L- W C4 V4 d O 


taarttaat t 


y ^ ci \^ ^ kv> 0 


24 60 


raotraraoa 


aaaacatctt 

*-* GL \-yi \A U •— 1 — I— w 


acoaataoca 


taaraataaa 


aaaattatoc 


aatoctarra 


2520 




Ly a Lddvd^> L 


y\_»yyi_i u >cici\-> l 


■|~ao't*+*<* , '+■^T3^ , 

L Ct ^ L LLLUdL 


a a L-y a Luyy d 


y y dLLy day y 






L L L L L uy UdU 


a a L-d Lyyyyy 


a llo uy Lddu 


t" pnrrl" I"n3 1* 
LLyn, l, uyd l. 


L>y l LyyyddL. 


6D"1U 


L.yydyL.Lydd 


LyddyLOdLd 


p*/"* .a .a a p*pt a p - pt 
LL-ddaLy dL.y 


ctyoy uydudu 


Lduyd LyuL l. 


nt" anpa a t - nrt 
y LdyLdd Lyy 


Z f \J u 


L,daL-aaL-yLL- 


yCyCdadLCd 


t* t* a a +» n n c fi 
LtddOLyycy 


ddCtaCLLdC 


♦"O+rapiot" +*oo 
LLLdyLL LLV, 


LyyuddLddL 


97 fin 

Z. / DLI 


♦"/"tat" a /■» a /*> t- /r 
L y a LdydLLy 


g a t_ g g a yy C y 


/"Ya*t~aaa/"Tt"'+"'<"T 

yd Ldddy l ty 


parfrtapAaAf 
L-ayyciL.(_cl(_-L 


LLtyuyuLoy 


r~* /-* 4~ i" /try 
y L/L.L. L LLLy y 




y' - * 4- /~x f*w 4- 4— 


LaLLgCLydt 


dddLLLyydy 


uc_gyLgcigt-y 


LgyuLLLogL 


yyidLLdLLy 


9 ft ft n 

z 0 0 u 


CayCaCiyyu 


S-t *— . -25 r~Y m +" /~Y r"Y 4~ 

gCCagaLyyt 


adyCCCLCCC 


g ta ucg Lag u 


LdLCtdCdCy 


/w ^1 4~ * 

aeggggagee 




ayycadCLdL 


ggangaacga 


aatagacaga 


uegcugagat 


aggugcccca 


4- /-> 4— 4~ 

CLyaLLdagC 




a t tggt agga 


attaatgatg 


tetegtttag 


ataaaagtaa 


agtgattaac 


agegcattag 


DU 


-i ripf" /*r 4- 4- n Ti 

dyCr.gcttaa 


xgagg tegga 


a Lcgaagg l l 


LddCddCCCy 


LdddCLCgCC 


cagaagcLag 


71 9 n 


/— » 4— /~» 4- ^ /-w ^ /-*t ~\ 

y cytayayCa 


gcccacauiig 


LdLLyyCdLg 


LadaaddLda 


/"» /-^t ^-f 4— 4— 4— 

geggget l eg 


/~» 4~ /^t ii fx r~* 4~ 

CLcgacgccx. 


7 1 ft n 
0 1 0 u 


LayCLd L Ly a 


ydLyLLdydL 


dyyudCL-d L a 


r"h r*a r , t*+' ^ f"rr 
LLLdLLLLCy 


/—•/-■/-< 4- 4- 4^ a rra a 
L-L. L. L L Lay dd 


yyyydddy l l 


J c. H \J 


yytadydLL L 


fff a/^nhaal" 
LLCduy LddL 


s a t~*r\f> faaaa 
ddLy L Ldddd 


y L L L Ldyd Ly 


LyLLLLdL-Ld 


afii* pat* f*ftrT*t 

dy Ltd Luy uy 




d uyydyoddd 


~3 4- -ji ^» 4~ 4~" "f~ a 

dy LdLdLLLd 


yyt-diL-cicyyc^ 


^LclL>ciyctclclc3 


dOciy Let Ly da 


Z^C > \~ pt" P*PTja J3 3 

dL LL. LL<y ddd 




a L L- a ci L LdyL 


LLLLLLdLljL 


/-> araanni" f- 
v^ctclv^ctciyy U L 


l. l, LudLLdyd 


yddLy^dLLd 


L d Ly LaL LLa 


3420 


nrnrfl nt" rrnn 


nr > at - t*i"i"3r1' 

y l <— i— lol L 


1" f^nnf hnrn 

(_ Layij i_ i_ y \w# y 




i— a a vJ a v_j ci 


V-^ Ci Ci y l» ^ ^ Ci 


3480 




a a a a a cachet 






atfattacaa 


raaactatca 


3540 


aattatttga 


t caeca aggt 


geagagecag 


ccttcttatt 


eggecttgaa 


t tga teat at 


3600 


gcggattaga 


aaaacaactt 


aaatgtgaaa 


gtgggtct ta 


aaagcagcat 


aacct tt t tc 


3660 


cgtgatggta 


actt cactag 


t t taaaagga 


tctaggtgaa 


gatcct tttt 


gataatct ca 


3720 


tgaccaaaat 


cccttaacgt 


gagttttcgt 


tccactgagc 


gtcagacccc 


gtagaaaaga 


3780 


tcaaaggatc 


ttcttgagat 


cctttttttc 


tgcgcgtaat 


ctgctgcttg 


caaacaaaaa 


3840 


aaccaccgct 


accageggtg 


gtttgtttgc 


eggatcaaga 


gctaccaact 


ctttttccga 


3900 


aggtaactgg 


cttcagcaga 


gegcagatae 


caaatactgt 


ccttctagtg 


tagcegtagt 


3960 


taggccacca 


cttcaagaac 


tctgtagcac 


cgcctacata 


cctcgctctg 


ctaatcctgt 


4020 


taccagtggc 


tgctgccagt 


ggcgataagt 


cgtgtcttac 


cgggttggac 


tcaagacgat 


4080 



WO 01/21772 



PCTYUS00/25993 



113 - 



agttaccgga taaggcgcag cggtcgggct gaacgggggg ttcgtgcaca cagcccagct 4140 

tggagcgaac gacctacacc gaactgagat acctacagcg tgagctatga gaaagcgcca 4200 

cgcttcccga agggagaaag gcggacaggt atccggtaag cggcagggtc ggaacaggag 4260 

agcgcacgag ggagcttcca gggggaaacg cctggtatct ttatagtcct gtcgggtttc 4320 

gccacctctg acttgagcgt cgatttttgt gatgctcgtc agggggg.cgg agcctatgga 4380 

aaaacgccag caacgcggcc tttttacggt tcctggcctt ttgctggcct tttgctcaca 44 40 
tgacccgaca 4 4 50 



<210> 81 
<211> 10212 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Recombinant 
pAN251 plasmid 



<400> 81 
gcggccgcta 


aaaagagctt 


gaggatttgc 


ggagtgaaaa 


tcagacattg 


cggaatcagc 


60 


tagagatgac 


agaagaggat 


tacaaggcac 


tgatcgatat 


catggatcgg 


gccagaaaaa 


120 


tggttgtttc 


gaaggaagac 


ggaagaatga 


aaaaagcggc 


tcaagaaacg 


taaagaaacg 


180 


cctgaaatga 


accggcccta 


tagtaagaat 


aggccggttg 


ttttgatttc 


tatgcagact 


240 


ctcccggtgt 


catttcgcga 


tccatatcag 


gatgccagat 


gagcgggtct 


tcccctttgt 


300 


cccgcgccat 


atcatactta 


acagttttaa 


agttcatttt 


gttccaaaat 


tccgctgatt 


360 


tcattctcgg 


atttgtccgg 


atcggcattt 


tgaatgattt 


tgcaaattca 


accaaggctc 


420 


tcccgtatcc 


cctgttctgg 


tagcccggaa 


gaacctcaag 


cttccacagc 


tccaaataat 


480 


cctggcggtt 


gtcaaaatag 


ggattcgatt 


tgccgttaac 


ttgatacaga 


ctcattcgtg 


540 


ctacaagttt 


atcgccaaaa 


taaatcccgt 


aaaaaggcga 


ggtgctgtca 


ttttcaataa 


600 


tattatcctg 


aagttcttca 


agcattgaaa 


gctcctgaat 


gccgtattct 


ttgaatttct 


660 


tgaattcttc 


cagcgtttta 


tagttgataa 


gcagacgttc 


tacctttgtc 


aaacaaatct 


720 


ccccctttgt 


tgtttctaca 


tatattgtaa 


acgctttatt 


taaaaaatcc 


aaatatttaa 


780 


actttaattt 


taagcacatg 


ggatctttga 


gaagtaattt 


cttcttactt 


ctgctatgat 


840 


aatacgtaaa 


tgcgtcgacc 


gaaacagcag 


ttataaggca 


tgaagctgtc 


cggtttttgc 


900 


aaaagtggct 


gtgactgtaa 


aaagaaatcg 


aaaaagaccg 


ttttgtgtga 


aaacggtctt 


960 
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tttgtttcct 


tttaaccaac 


tgccataact 


cgaggcctac 


ctagcttcca 


agaaagatat 


1020 


cctaacagca 


caagagcgga 


aagatgtttt 


gttctacatc 


cagaacaacc 


tctgctaaaa 


1080 


ttcctgaaaa 


attttgcaaa 


aagttgttga 


ctttatctac 


aaggtgtggt 


ataataatct 


1140 


taacaacagc 


aggacgctct 


agaggaggag 


acatcatgaa 


aattggaatt 


atcggcggag 


1200 


gctccgttgg 


tcttttatgc 


gcctattatt 


tgtcacttta 


tcacgacgtg 


actgttgtga 


1260 


cgaggcggca 


agaacaggct 


gcggccattc 


agtctgaagg 


aatccggctt 


tataaaggcg 


1320 


gggaggaatt 


cagggctgat 


tgcagtgcgg 


acacgagtat 


caattcggac 


tttgacctgc 


1380 


ttgtcgtgac 


agtgaagcag 


catcagcttc 


aatctgtttt 


ttcgtcgctt 


gaacgaatcg 


1440 


ggaagacgaa 


tatattattt 


ttgcaaaacg 


gcatggggca 


tatccacgac 


ctaaaagact 


1500 


ggcacgttgg 


ccattccatt 


tatgttggaa 


tcgttgagca 


cggagctgta 


agaaaatcgg 


1560 


atacagctgt 


tgatcataca 


ggcctaggtg 


cgataaaatg 


gagcgcgttc 


gacgatgctg 


1620 


aaccagaccg 


gctgaacatc 


ttgtttcagc 


ataacca ttc 


ggattttccg 


atttattatg 


1680 


agacggattg 


gtaccgtctg 


ctgacgggca 


agctgattgt 


aaatgcgtgt 


attaatcctt 


1740 


taactgcgtt 


at tgcaagtg 


aaaaatggag 


aactgctgac 


aacgccagct 


ta tctggctt 


1800 


ttatgaagct; 


ggtatttcag 


gaggcatgcc 


gcattttaaa 


acttgaaaat 


gaagaaaagg 


1860 


cttgggagcg 


ggttcaggcc 


gtttgtgggc 


aaacgaaaga 


gaatcgttca 


tcaatgctgg 


1920 


ttgacgtcat 


tggaggccgg 


cagacggaag 


ctgacgccat 


tatcggatac 


ttattgaagg 


1980 


aagcaagtct 


tcaaggtctt 


gatgccgtcc 


acctagagtt 


tttatatggc 


agcatcaaag 


2040 


cattggagcg 


aaatacaaac 


aaagtctttt 


gagctttttc 


ggtaacatgc 


tatactcatt 


2100 


tcggatcact 


aactatttat 


tggagaaagg 


aagttctaga 


agatgcagct 


aactgaactt 


2160 


tccatcaaaa 


atcagaatgt 


gtttgtacag 


cactatatag 


atggcaaaga 


agaaatgtct 


2220 


tctttttttg 


attacagtat 


tcatcataag 


gacatgtggc 


gcgaaagact 


ggaagactta 


2280 


tcttcccggt 


ttttcgcaag 


agaggaattg 


gcggcgtact 


taacctctta 


ccataataaa 


2340 


ttcggttcaa 


gtgcgatgca 


gtctgctatt 


gagaagctga 


aggacccgtc 


aagtgccgct 


2400 


gtagtcggcg 


gacagcaggc 


aggactttta 


acaggaccgc 


tttacaccat 


acataaaatc 


2460 


atttcaatca 


ttgttttagc 


aaagcaacaa 


gaaaaggaac 


tgcaagtgcc 


tgtcatacca 


2520 


atcttctggg 


tggctggaga 


agaccacgat 


ttggatgaga 


ttaattttgt 


tcacacatct 


2580 


gaagagaatg 


ggcctgtgaa 


aaaaaagctg 


cctcagtctt 


attggaagaa 


atcatcagca 


2640 


gcgagtacat 


cgcttgatca 


ggaaaagtgt 


gccgcgtgga 


tagatgatgt 


ttttgccgct 


2700 
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tttgaagaaa 


cagaccatac 


gaatacactt 


ctcgacaatg 


tgaaacgatg 


tttaagggaa 


2760 


tctgttacgt 


ttactgactt 


ctttgaactg 


ctgatcgcgg 


atttgttcca 


agaagagggc 


2820 


ttagttttat 


taaattctgg 


ggatcctgtt 


ttggcggatg 


agagaagatt 


ttcagcctga 


2880 


tacagattaa 


atcagaacgc 


agaagcggtc 


tgataaaaca 


gaatttgcct 


ggcggcagta 


2940 


gcgcggtggt 


cccacctgac 


cccatgccga 


actcagaagt 


gaaacgccgt 


agcgccgatg 


3000 


gtagtgtggg 


gtctccccat 


gcgagagtag 


ggaactgcca 


ggcatcaaat 


aaaacgaaag 


3060 


gctcagtcga 


aagactgggc 


ctttcgtttt 


atctgttgtt 


tgtcggtgaa 


cgctctcctg 


3120 


agtaggacaa 


atccgccggg 


agcggatttg 


aacgttgcga 


agcaacggcc 


cggagggtgg 


3180 


cgggcaggac 


gcccgccata 


aactgccagg 


catcaaatta 


agcagaaggc 


catcctgacg 


3240 


gatggccttt 


ttgcgtttct 


acaaactctt 


ggtacccaga 


aaaagcggca 


aaagcggctg 


3300 


ttaaaaaagc 


gaaatcgaag 


aagctgtctg 


ccgctaagac 


ggaatatcaa 


aagcgttctg 


3360 


ctgttgtgtc 


atctttaaaa 


gtcacagccg 


atgaatccca 


gcaagatgtc 


ctaaaatact 


3420 


tgaacaccca 


gaaagataaa 


ggaaatgcag 


accaaattca 


ttcttattat 


gtggtgaacg 


3480 


ggattgctgt 


tcatgcctca 


aaagaggtt.a 


tggaaaaagt 


ggtgcagttt 


cccqaagtgg 


3540 


aaaaggtgct 


tcctaatgag 


aaacggcagc 


tttttaagtc 


atcctcccca 


tttaatatga 


3600 


aaaaagcaca 


gaaagctatt 


aaagcaactg 


acggtgtgga 


atggaatgta 


gaccaaatcg 


3660 


atgccccaaa 


agcttgggca 


cttggatatg 


atggaactgg 


cacggttgtt 


gcgtccattg 


3720 


ataccggggt 


ggaatggaat 


catccggcat 


taaaagagaa 


atatcgcgga 


tataatccgg 


3780 


aaaatcctaa 


tgagcctgaa 


aatgaaatga 


actggtatga 


tgccgtagca 


ggcgaggcaa 


3840 


gcccttatga 


tgatttggct 


catggaaccc 


acgtgacagg 


cacgatggtg 


ggctctgaac 


3900 


ctgatggaac 


aaatcaaatc 


ggtgtagcac 


ctggcgcaaa 


atggattgct 


gttaaagcgt 


3960 


tctctgaaga 


tggcggcact 


gatgctgaca 


ttttggaagc 


tggtgaatgg 


gttttagcac 


4020 


caaaggacgc 


ggaaggaaat 


ccccacccgg 


aaatggctcc 


tgatgttgtc 


aataactcat 


4080 


ggggaggggg 


ctctggactt 


gatgaatggt 


acagagacat 


ggtcaatgcc 


tggcgttcgg 


4140 


ccgatatttt 


ccctgagttt 


tcagcgggga 


atacggatct 


ctttattccc 


ggcgggcctg 


4200 


gttctatcgc 


aaatccggca 


aactatccag 


aatcgtttgc 


aactggagcg 


actgagaatt 


4260 


ccaattcccc 


atggagagaa 


aagaaaatcg 


ctaatgttga 


ttactttgaa 


cttctgcata 


4320 


ttcttgaatt 


taaaaaggct 


gaaagagtaa 


aagattgtgc 


tgaaatatta 


gagtataaac 


4380 


aaaatcgtga 


aacaggcgaa 


agaaagttgt 


atcgagtgtg 


gttttgtaaa 


tccaggcttt 


4440 
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gtccaatgtg 


caactggagg 


ctgaagttat 


taaacaaaag 


atgtttatga 


tggcgaagaa 


gaatgatgca 


atataaaaaa 


tgacaataaa 


taataaagat 


aaccaactta 


ttttaagaat 


ggaaaaaggc 


aatgaaatta 


aaaataaata 


taaatcggat 


aggatacgga 


ttttatgacc 


aggaaggttt 


acaccgtaaa 


aaaaattaaa 


ccttgatgac 


aagccgatga 


agatggattt 


ttattaaaga 


gtagttcaac 


ttaaaactta 


gtttatatgt 


aagtgtaatg 


aatgtaacag 


actacgtctg 


ccctcattat 


ttacgacaca 


accaaatttt 


gaaatggttt 


tgaacgtctc 


agtacaaact 


gggtgaacac 


ggaaagctat 


ctgatcaatt 


tgtttcgggt 


cggtaattgg 


cgttttattc 


aaggggctgg 


cgctatattc 


caaaggaaaa 


atgggagaag 


gagtcggtcc 


tatcttctac 


tcattcctat 


aagaaagaag 


taaggataaa 


ggcattgtat 


tttttatgtt 


gtgctgtcat 


tcctgatatt 


cccggattag 


ggaaaaatat 
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agagcaatga aacatggcat 
ccaacagttc gttggttgtt 
ttaaataaga gtttgtcaga 
attaataaaa atcttgttgg 
aattcttata atcagcacat 
acagaaaact acgtgaatca 
gactatgatc caaatgtaaa 
atacaatcgg caattgacga 
gatgatgaag aaaagaattt 
aggttaatct cctatggtgg 
acagaagaag gcgatttgat 
tctattattg caatgtggaa 
aaacgggcca tattgttgta 
ggtaaaatgt tttaatcaag 
ggttcaatta aaagagggaa 
tggagggtga aatgtgaata 
aatttggctt tgcattttat 
attacctgat attgcaaatg 
agcctttatg ttaacctttt 
aggcatcaaa aggttactcc 
gtttgttggc cattctttct 
tgcagctgca tttccagcac 
taggggtaaa gcatttggtc 
agcgattggt ggaatgatag 
gataacaatt atcactgttc 
aggtcatttt gatatcaaag 
gtttacaaca tcatatagca 
tgtaaaacat atcaggaaag 
accttttatg attgga,gttc 
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tcagtcacaa 


aaggttgttg 


4500 


tctcacatta 


acagttaaaa 


4560 


tatggctcaa 


ggatttcgcc 


4 620 


ttttatgcgt 


gcaacggaag 


4680 


gcatgtattg 


gtatgtgtgg 


4740 


aaaacaatgg 


attcaatttt 


4800 


agttcaaatg 


attcgaccga 


4860 


aactgcaaaa 


tatcctgtaa 


4920 


gaaacgtttg 


tctgatttgg 


4980 


tttgttaaaa 


gaaatacata 


5040 


tcatacagat 


gatgacgaaa 


5100 


ttgggaacgg 


aaaaattatt 


5160 


taagtgatga 


aatactgaat 


5220 


tttaggagga 


attaattatg 


5280 


gcgtatcatt 


eaccctataa 


5340 


catcctattc 


acaatcgaat 


5400 


ctttttttag 


cgtattaaat 


54 60 


attttaataa 


accacctgcg 


5520 


ccattggaac 


agctgtatat 


5580 


tatttggaat 


tataataaat 


5640 


tttccttact 


tattatggct 


5700 


tcgtaatggt 


tgtagttgcg 


5760 


ttattggatc 


gatagtagcc 


5820 


cccattatat 


tcattggtcc 


5880 


cgtttcttat 


gaaattatta 


5940 


gaattatact 


aatgtctgta 


6000 


tttcttttct 


tatcgttagc 


6060 


taacagatcc 


ttttgttgat 


6120 


tttgtggggg 


aattatattt 


6180 
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ggaacagtag 


cagggtttgt 


agtactgccg 


aaatcggaag 


ggctacattg 


gtgggatact 


gttacatttc 


tttctgttag 


ttcatgacaa 


ttataatcgt 


tcaacaattg 


tttcaagtag 


aactttacca 


gctttttatc 


atacccttac 


ttgatcaaag 


agtaatttgt 


tattactttt 


gtatataaac 


attctcaaag 


tcaaaattga 


tccttttttt 


gcctcgtgat 


acgcctattt 


caggtggcac 


ttttcgggga 


attcaaatat 


gtatccgctc 


aaaggaagag 


tatgagtatt 


■tttgccttcc 


tgtttttgct 


agttgggtgc 


acgagtgggt 


gttttcgccc 


cgaagaacgt 


cggtattatc 


ccgtattgac 


agaatgactt 


ggttgagtac 


taagagaatt 


atgcagtgct 


tgacaacgat 


cggaggaccg 


taactcgcct 


tgatcgttgg 


acaccacgat 


gcctgcagca 


ttactctagc 


ttcccggcaa 


cacttctgcg 


ctcggccctt 


agcgtgggtc 


tcgcggtatc 


tagttatcta 


cacgacgggg 


agataggtgc 


ctcactgatt 



-117- 

ctctatggtt ccttatatga 
tgtaattatt ttccctggaa 
tgttgataga agaggtcctt 
ctttttaact gcttcctttc 
atttgtttta ggtgggcttt 
cttgaaacag caggaagctg 
agagggaaca ggtattgcaa 
gttgttacct atggaagttg 
ttcaggaatc attgtcatta 
ggatttctaa atcgttaagg 
ataacagttc gaagcggccg 
ttataggtta atgtcatgat 
aatgtgcgcg gaacccctat 
atgagacaat aaccctgata 
caacatttcc gtgtcgccct 
cacccagaaa cgctggtgaa 
tacatcgaac tggatctcaa 
tttccaatga tgagcacttt 
gccgggcaag agcaactcgg 
tcaccagtca cagaaaagca 
gccataacca tgagtgataa 
aaggagctaa ccgctttttt 
gaaccggagc tgaatgaagc 
atggcaacaa cgttgcgcaa 
caattaatag actggatgga 
ccggctggct ggtttattgc 
attgcagcac tggggccaga 
agtcaggcaa ctatggatga 
aagcattggt aactgtcaga 
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tgaaagatgt tcaccagcta 6240 
caatgagtgt cattattttc 6300 
tatacgtgtt aaacatcgga 6360 
ttttagaaac aacatcatgg 6420 
cgttcaccaa aacagttata 6480 
gtgctggaat gagtttgctt 6540 
ttgtaggtgg tttattatcc 6600 
atcagtcaac ttatctgtat 6660 
gttggctggt taccttgaat 6720 
gatcaacttt gggagagagt 6780 
caattcttga agacgaaagg 6840 
aataatggtt tcttagacgt 6900 
ttgtttattt ttctaaatac 6960 
aatgcttcaa taatattgaa 7020 
tattcccttt tttgcggcat 7080 
agtaaaagat gctgaagatc 7140 
cagcggtaag atccttgaga 7200 
taaagttctg ctatgtggcg 7260 
tcgccgcata cactattctc 7320 
tcttacggat ggcatgacag 7380 
cactgcggcc aacttacttc 7440 
gcacaacatg ggggatcatg 7500 
cataccaaac gacgagcgtg 7560 
actattaact ggcgaactac 7620 
ggcggataaa gttgcaggac 7680 
tgataaatct ggagccggtg 7740 
tggtaagccc tcccgtatcg 7800 
acgaaataga cagatcgctg 7860 
ccaagtttac tcatatatac 7920 
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tttagattga 


tttaaaactt 


catttttaat 


ttaaaaggat 


ctaggtgaag 


atcctttttg 


7980 


ataatctcat 


gaccaaaatc 


ccttaacgtg 


agttttcgtt 


ccactgagcg 


tcagaccccg 


8040 


tagaaaagat 


caaaggatct 


tcttgagatc 


ctttttttct 


gcgcgtaatc 


tgctgcttgc 


8100 


aaacaaaaaa 


accaccgcta 


ccagcggtgg 


tttgtttgcc 


ggatcaagag 


ctaccaactc 


8160 


tttttccgaa 


ggtaactggc 


ttcagcagag 


cgcagatacc 


aaatactgtc 


cttctagtgt 


8220 


agccgtagtt 


aggccaccac 


ttcaagaact 


ctgtagcacc 


gcctacatac 


ctcgctctgc 


8280 


taatcctgtt 


accagtggct 


gctgccagtg 


gcgataagtc 


gtgtcttacc 


gggttggact 


8340 


caagacgata 


gttaccggat 


aaggcgcagc 


ggtcgggctg 


aacggggggt 


tcgtgcacac 


8400 


agcccagctt 


ggagcgaacg 


acctacaccg 


aactgagata 


cctacagcgt 


gagctatgag 


8460 


aaagcgccac 


gcttcccgaa 


gggagaaagg 


cggacaggta 


tccggtaagc 


ggcagggtcg 


8520 


gaacaggaga 


gcgcacgagg 


gagcttccag 


ggggaaacgc 


ctggtatctt 


tatagtcctg 


8580 


tcgggtttcg 


ccacctctga 


cttgagcgtc 


gatttttgtg 


atgctcgtca 


ggggggcgga 


8640 


gcctatggaa 


aaacgccagc 


aacgcggcct 


ttttacggtt 


cctggccttt 


tgctggcctt 


8700 


ttgctcacat 


gttctttcct 


gcgttatccc 


ctqattctgt 


ggataaccgt 


attaccgcct 


87 60. 


ttgagtgagc 


tgataccgct 


cgccgcagcc 


gaacgaccga 


gcgcagcgag 


tcagtgagcg 


8820 


aggaagcgga 


agagcgcctg 


atgcggtatt 


ttctccttac 


gcatctgtgc 


ggtatttcac 


8880 


accgcatatg 


gtgcactctc 


agtacaatct 


gctctgatgc 


cgcatagtta 


agccagtata 


8940 


cactccgcta 


tcgctacgtg 


actgggtcat 


ggctgcgccc 


cgacacccgc 


caacacccgc 


9000 


tgacgcgccc 


tgacgggctt 


gtctgctccc 


ggcatccgct 


tacagacaag 


ctgtgaccgt 


9060 


ctccgggagc 


tgcatgtgtc 


agaggttttc 


accgtcatca 


ccgaaacgcg 


cgaggcagct 


9120 


gcggtaaagc 


tcatcagcgt 


ggtcgtgaag 


cgattcacag 


atgtctgcct 


gttcatccgc 


9180 


gtccagctcg 


ttgagtttct 


ccagaagcgt 


taatgtctgg 


cttctgataa 


agcgggccat 


9240 


gttaagggcg 


gttttttcct 


gtttggtcac 


ttgatgcctc 


cgtgtaaggg 


ggaatttctg 


9300 


ttcatggggg 


taatgatacc 


gatgaaacga 


gagaggatgc 


tcacgatacg 


ggttactgat 


9360 


gatgaacatg 


cccggttact 


ggaacgttgt 


gagggtaaac 


aactggcggt 


atggatgcgg 


9420 


cgggaccaga 


gaaaaatcac 


tcagggtcaa 


tgccagcgct 


tcgttaatac 


agatgtaggt 


9480 


gttccacagg 


gtagccagca 


gcatcctgcg 


atgcagatcc 


ggaacataat 


ggtgcagggc 


9540 


gctgacttcc 


gcgtttccag 


actttacgaa 


acacggaaac 


cgaagaccat 


tcatgttgtt 


9600 


gctcaggtcg 


cagacgtttt 


gcagcagcag 


tcgcttcacg 


ttcgctcgcg 


tatcggtgat 


9660 
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tcattctgct 


aaccagtaag 


gcaaccccgc 


cagcctagcc 


gggtcctcaa 


cgacaggagc 


9720 


acgatcatgc 


gcacccgtgg 


ccaggaccca 


acgctgcccg 


agatgegecg 


cgtgcggctg 


9780 


ctggagatgg 


cggacgcgat 


ggatatgttc 


tgccaagggt 


tggtttgcgc 


attcacagtt 


9840 


ctccgcaaga 


attgattggc 


tccaattctt 


ggagtggtga 


ateegttage 


gaggtgeege 


9900 


cggcttccat 


tcaggtcgag 


gtggcccggc 


tccatgcacc 


gcgacgcaac 


geggggagge 


9960 


agacaaggta 


tagggcggcg 


cctacaatcc 


atgccaaccc 


gttccatgtg 


ctcgccgagg 


10020 


cggcataaat 


cgccgtgacg 


ateageggtc 


cagtgatcga 


agttaggctg 


gtaagagccg 


10080 


cgagcgatcc 


ttgaagctgt 


ccctgatggt 


cgtcatctac 


ctgcctggac 


ageatggect 


10140 


gcaacgcggg 


catcccgatg 


ccgccggaag 


cgagaagaat 


cataatgggg 


aaggccatcc 


10200 


agcctcgcgt 


eg 










10212 



<210> 82 

<211> 10426 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Recombinant 
pAN267 plasmid 



<400> 82 



aacaaaattc 


tccagtcttc 


acatcggttt 


gaaaggagga 


ageggaagaa 


tgaagtaaga 


60 


gggatttttg 


actccgaagt 


aagtcttcaa 


aaaatcaaat 


aaggagtgtc 


aagaatgt tt 


1.20 


gcaaaacgat 


tcaaaacctc 


tttactgccg 


ttattcgctg 


gatttttatt 


gctgtttcat 


180 


ttggttctgg 


caggaccggc 


ggctgcgagt 


getgaaaegg 


cgaacaaatc 


gaatgagctt 


240 


acagcaccgt 


cgatcaaaag 


eggaaccatt 


ettcatgeat 


ggaattggtc 


gttcaatacg 


300 


ttaaaacaca 


atatgaagga 


tattcatgat 


gcaggatata 


cagccattca 


gacatctccg 


360 


attaaccaag 


taaaggaagg 


gaatcaagga 


gataaaagca 


tgtcgaactg 


gtactggctg 


420 


tatcagcega 


categtatea 


aattggcaac 


cgttacttag 


gtactgaaca 


agaatttaaa 


480 


gaaatgtgtg 


cagccgctga 


agaatatggc 


ataaaggtca 


ttgttgacgc 


ggtcatcaat 


540 


cataccacca 


gtgattatgc 


cgcgatttcc 


aatgaggtta 


agagtattcc 


aaactggaca 


600 


catggaaaca 


cacaaattaa 


aaactggtct 


gatcgaaata 


gtacataatg 


gatttcctta 


660 


cgegaaatae 


gggcagacat 


ggcctgcccg 


gttattatta 


tttttgacac 


cagaccaact 


720 


ggtaatggta 


gcgaccggcg 


ctcaggatcg 


tctcggtacc 


aagagtttgt 


agaaaegcaa 


780 


aaaggecate 


cgtcaggatg 


gccttctgct 


taatttgatg 


cctggcagtt 


tatggeggge 


840 
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gtcctgcccg 


ccaccctccg 


ggccgttgct 


tcgcaacgtt 


caaatccgct 


cccggcggat 


900 


ttgtcctact 


caggagagcg 


ttcaccgaca 


aacaacagat 


aaaacgaaag 


gcccagtctt 


960 


tcgactgagc 


ctttcgtttt 


atttgatgcc 


tggcagttcc 


ctactctcgc 


atggggagac 


1020 


cccacactac 


catcggcgct 


acggcgtttc 


acttctgagt 


tcggcatggg 


gtcaggtggg 

3 3 3 3 3 3 


1080 


accaccgcgc 


tactgccgcc 


aggcaaattc 


tgttttatca 


gaccgcttct 


gcgttctgat 


1140 


ttaatctgta 


tcaggctgaa 


aatcttctct 


catccgccaa 


aacaggatcc 


atcacgaagc 


1200 


gtcgtatcga 


aaaaattaat 


tttgcgcaac 


ggagaccacc 


gcttccttct 


tcttgccttg 


1260 


tttcacaaac 


ggcatcattt 


cacgaagct c 


tcttcccact 


acttcgattt 


gatgttcgtt 


1320 


ctcgcttgca 


ttgatagcgt 


taaaacaaaq 


acggtttact 


tggtt tt cga 


cgatccactc 


1380 


ttttgcgaat 


gtaccgtttt 


ggatatcttt 


taa tact tct 


t tcatagat t 


cttttacttt 


1440 


agcgtccaca 


acacaaaaoc 


ctga tacgaa 


at ctccccac 


tqtqctqtqt 


cagagattga 


1500 


atatctcatt 


cctgcaagtc 


ct t cttcgta 


cataaggtc t 


acgattaat t 


t cagctcatg 


1560 


aagacactcg 


aagtatgcaa 


qttcaqqctq 


ataacctgct 


tcagt taagg 


tt tcaaatcc 


1620 


ggct t tgaca 


agcgcqctta 


atccgccgca 


aagaactgct 


tgctcaccga 


acaaatctgt 


1680 


ttctgtttct 


tctttaaatg 


tcgtttctaa 


tacgcccgct 


cttgcgccgc 


cgattccttt 


1740 


agcataagcg 


aqqqctt tq t 


ctcttgcttc 


tccag tcaca 


tcttgataga 


ttgcgaacaa 


1800 


tqcaqqtacq 


ccagctcct t 


gctcatatgt 


tcttcttacc 


aagtgtcccg 


qqcctttaqq 


18 60 


ggccactaag 


aatacatcta 


catccgccgg 


aggaacaatt 


tgatggaaat 


gcacgttaaa 


1920 


tccatgagcg 


aatactaatg 


attttcctgc 


tgtcaattca 


tctttgattt 


cagcttcgta 


1980 


tactttttgc 


tgctgctcat 


ccggaagcag 


aaccatgatg 


atttcggctt 


gggctgccgc 


2040 


ttcttttact 


gaaaatactt 


tatgtccgtc 


ttcttgggct 


tgagtgaaag 


attttccttg 


2100 


tctaacaccg 


acgatcacgt 


ctactccgct 


ttctttaagg 


ttcagggcat 


gtgcgtggcc 


2160 


ttgcgaaccg 


tacccgataa 


ccgctactgt 


ttttccagcc 


aatacgttct 


ctttgatatc 


2220 


accgttataa 


tatactttta 


ccatt tcaat 


ctctcccttg 


1.1. Ui. a, 

t ta tgtctta 


t acaatagat 


o o q n 


attgttttat 


tggatgacgc 


cttttgctgg 


ttcccctcgc 


aaaagccgtt 


gtacctgttc 


2340 


tcgcgatttc 


tttaatgcca 


taaggtttta 


ataactcaat 


aagcgcttca 


attttgttag 


2400 


attcacctgt 


cacctgaaca 


acgatgctgt 


ctctgctgac 


atcaacgaca 


gaggctctaa 


2460 


acggttctat 


gattccatta 


atctctgttc 


ttgttgaagg 


tgcggagaca 


accttgatta 


2520 


aggccagctc 


cctctggaca 


atcgattgat 


ttgtgatgtc 


tgtgactttc 


agcacatcaa 


2580 
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tctgtttgtt 


gagctgtttc 


gttaactgtt 


caacatcatt 


ttcaccttca 


acatgaacga 


2640 


cgaaggtgat 


tctggaaacg 


ccggctgttt 


ctgtgtgtcc 


aactgtaatg 


ctttcaatgt 


2700 


tgtaatgcct 


ttttgtgaat 


agaccggtga 


tccggtttaa 


caccccggag 


cggttcacca 


2760 


cagtcaatgt 


gataattctt 


ttcaaggttt 


cacccccacc 


atttcatgea 


gccctttccc 


2820 


cggagccacc 


ategggaata 


ctttttcttc 


gctggcaacc 


cgcacgtcaa 


tgacaacagg 


2880 


ttctcttgat 


gttaatgect 


cttccagctt 


ttcctttget 


tccgcttctg 


atgaaat tct 


2940 


gatgccttta 


atgccgtatg 


ctreggacaa 


tttgacgaag 


teaqqetqaq 


aagcgaattt 


3000 


agat tctgaa 


taacgttctt 


catagaaaa t 


1 1 cctgccac 


tgt ctgacca 


ttccgagaca 


3060 


agcgttattt 


aaaatcacta 


ccttgaccgg 


aagatttaat 


tegegaataa 


cat cgagt tc 


3120 


t tgaagcgt c 


at t tggaatc 


cgccgtct cc 


gacaaccgcg 


acaacagtag 


catct ttttc 


3180 


ggccagctqt 


gegcega teg 


ccgccggaag 


accgaat ccc 


atcgttccaa 


gtccgcct ga 


3240 


cgtgacccat 


ttatctgett 


t ttggaacgg 


at aaaatt gc 


get gaccaca 


t ttgatgctg 


3300 


gcctaca tec 


gttgcgacaa 


t ggectet cc 


tt ttgtaaat 


tga tgaa tat 


att caatcaa 


3360 


tttctgaggt 


t taaaacctt 


cttcttcatt 


atctacatac 


caqaqcqqat 


actcttcttt 


3420 


ccattctgcg 


agctgttttt 


tccattcgct 


tgaatcgett 


tgtttgccgt 


cttgtttgat 


3480 


caqct ectge 


aggacaattt 


tgctgtctcc 


gactacagga 


atctgtgttt 


tcatgatttt 


354 0 


tccgatttca 


gctggat caa 


tatcgatgtg 


ggctatcttt 


qcqtttctqq 


caaagtgttt 


3600 


caggtttcct 


qtqacacqqt 


catcaaaacg 


qqeqecqata 


ctgattaata 


gatcacattc 


3660 


atqaagggee 


atattggctg 


tataagtacc 


gtgcattccc 


gccatcccta 


ggaaaagegg 


3720 


atggtcagcc 


gggaagcetc 


cgagccccaa 


aagggtgtgt 


gecacaggga 


tttgctgctg 


3780 


ttcagcataa 


ttttttaatt 


cttctgacgc 


ttttccgtgc 


agtacgcccg 


cacccgccag 


3840 


gatcaceggt 


tttttegcac 


tgctcacggc 


ttccacaagc 


ttgeggatet 


gcaaataatt 


3900 


cggctctgtt 


gtcggctggt 


atecegggag 


attcatctca 


tgategtage 


tgaattctcc 


3960 


ttcaattgtt 


gctacatctt 


teggaatate 


aatcaataca 


ggtccgggtc 


ttccagttgt 


4020 


tgcaatatgg 


aacgettett 


taatgatgcg 


eggcagatet 


tccggctggc 


gaacctggta 


4080 


gctgtgtttt 


gttactggca 


tcgtaatccc 


taaaatgtct 


gcttcctgaa 


atgeatcget 


4140 


cccgattaca 


gaggttgeta 


cctgccctgt 


aaagacgact 


aacggcaatg 


aatcaatcat 


4200 


ggcatcagca 


aggcctgtaa 


caaggtttgt 


cgctcccggc 


ctgacgtggc 


aatgacgaca 


4260 


ccggtttccg 


gagacccttg 


cgtatccctc 


cgctgcatga 


attgetcett 


gttcgtgacg 


4320 
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qqaaqgatat 


gtaccaacct 


gaatgtatag 


ettategtaa 


ateggaagea 


cagccccgcc 


4380 


cggataaccg 


aagatcatt t 


ctactttctc 


tttttttaat 


gattcaatca 


gcattaatcg 


4440 


tccgctcatc 


gtctgtgtac 


attcggcaga 


tgetgaatec 


acctgtacat 


tagtccccat 


4500 


tttatctcct 


cctctagagc 


gtcctgctgt 


tgttaagatt 


at t ataccac 


accttgtaga 


4560 


t aaagtcaac 


aact 1 1 1 tgc 


aaaatttt tc 


aggaatt tt a 


ocaaaaatta 

*2 ^ *^ w ^ 


t tctoaatot 


4620 


aaaacaaaac 

a \J C4 d UQUUU ^ — - 


atctttccac 

Li 1— X— • l_ L. V_ V — . \_ V_j V_* 


tcttotart'o 

v. u (,y l l- V- y 


t taoaat 1 ate 


tt tetfc aaaa 

L L L L» L. L. y y Ci Ci 


art aont aoa 

y v— l o y y l cj y lj 


4 630 


l. l. u^yoy no 


l y y v_.cty l l y y 


L LuuaQUiJaa 


dLaaciciaLjaL 


L- y L L L LL.clV_.Cl 


L,ClClC-ClL-yy LL, 


4740 


t"t*t t* f rash t 


j-pf h t tt ara 

L v— L L L L Laud 


y l LaLay Lua 


L, L L ty Laaa 


ddv t_ y y a l* a y 


LL L V_CJ LLJLLL 


4800 


t ataact art 

l d w a ci *_> l v — u 


Li L L L Vw U u L V \u 


ov^yoviyv^yy^ 


V_- Li wLU L L. L. CI CL 


CA L— \-J CT CI L L L> L* L 


nraort nnrn 

iJLuUL'LUyLM 


4860 


aauyy ^-*y ct L L 


l l v^y l wLy i.ij 


ciCiL.av_.oi_y i_ l 


a Lad LddL. Ld 


L Ct Ct L> L Ct CI L Ct Ct 


v_. y LddLyj LLjd 


4 920 




at" at" t" t" M" aa 
dLaL L L L Laa 


aaLaa Lijaa L 


a y y l l l dLdL 


L LaLL L LaLj L 


L LLdLLJLJddd 


H ? O \J 


Lyauaua L L. ci 


tafecatatat 

Latua LCI La L 


aaLLLO'-juaL 


awaa L LOaLL 


OtjCla L Oa L L CI 


1 1 a trt aaat* 

L LCI LL LdLja L 


5040 


aaaaaaf t"t"a 


rraarrr % r , a a 1~ n 




d Lcia.civ_.i.cicica 


tf aarrf t tat" 

LLClCiyLLLCJL 


L LQu L LQOL3 


5100 


a ct Ataaa ta 


taaaa1*annt* 


arfaatcaaa 


a Lau y y y 


nna t a t a 1 1 f 

La La L l.L 


naa t acntac 


5160 


^-4 O Ci l_. Ci C* CI ^ L. d 


ataaacifcciaa 

a t a o n vj i. V) a n 


QGOUU LQL L 


' — yvjaanv.a _. i_ 


Luoaoaa Lua 


ccf" 1" p\ Y t* rr n t~ 

llllu i- *- y y i— 


5220 


act tacatgt 


1 1 ggat cagg 


acrt t" era aa ot 


ooactaaaac 

M V* v_- L. Ci CI H V_* 


caaat agt ga 


tcttaacttt 


5-280 


1 1 aotcatca 

V l_ Ci v_j C V— ^ _ v — ^ J 


tatctaaacc 

L- Ci V- ^w- C* CI * — 


a 1 1 oacaaat 

(-1 l_ V_ W V_* Ci Ci l_ 


caaagt aaag 


aaatacttat 

Ci Ci CI L. Ci \_- V_ l_ Ci L. 


acaaaaaat t 

Ci Lu Ci CI Ci Ci a L L 


534 0 


agaccta ttt 


caaaaaaaa t 


aggagataaa 


agcaact tac 


gata tat tga 


at t aacaatt 


5400 


at tattcagc 


aagaaatggt 


accgtggaat 


catcctccca 


aacaagaat t 


tatttatgga 


5460 


gaatggttac 


aagagcttta 


tgaacaagga 


tacattcctc 


agaaggaatt 


aaattcagat 


5520 


ttaaccataa 


tgctttacca 


agcaaaacga 


aaaaataaaa 


gaatataegg 


aaattatgac 


5580 


ttagaggaat 


tactacctga 


tattccattt 


tctgatgtga 


gaagagecat 


tatggattcg 


5640 


tcagaggaat 


taatagataa 


ttatcaggat 


gatgaaacca 


actctatatt 


aactttatgc 


5700 


cgt atga ttt 


taactat gga 


caegggtaaa 


atca taccaa 


aagatat tgc 


gggaaatgea 


3 / bl) 


gtggctgaat 


cttctccatt 


agaacatagg 


gagagaattt 


tgttagcagt 


tegtagttat 


5820 


cttggagaga 


atattgaatg 


gactaatgaa 


aatgtaaatt 


taactataaa 


ctatttaaat 


5880 


aacagattaa 


aaaaattata 


aaaaaattga 


aaaaatggtg 


gaaacacttt 


tttcaatttt 


5940 


tttgttttat 


tatttaatat 


ttgggaaata 


ttcattctaa 


ttggtaatca 


gattttagaa 


6000 


aacaataaac 


ccttgcatag 


ggggatcgat 


ateegtttag 


gctgggcggt 


gatagcttct 


6060 
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cgttcaggca 


gtacgcctct 


tttcttttcc 


aqacctqaqq 


qaqqcqqaaa 


tqqtqtqaqq 


6120 


ttcccqqqqa 


aaagccaaat 


aqqcaatcqc 


qqqaqtqctt 


tatttgaaga 


tcaggctatc 


6180 


actgcggtca 


atagatttca 


caatgtgatg 


qctqqacaqc 


ctgaggaact 


ctcgaacccg 


6240 


aatggaaaca 


accagatatt 


tatgaatcag 


cgcggctcac 


atqqcqttqt 


gctggcaaat 


6300 


nr~ , acrnttc**at 

y V-» d y uuuo u 


rctctatctc 


tatcaataca 

c> c* uv^uu v> v* >»y 


acaacaaaat 


tacctaat a a 


caggtat gac 


6360 


aataaaacta 




atttcaaata 


C* C4 v.* y C* V— \j ^ ^ 


aactigacagg 


cacgatcaat 


6420 


fir"r"ant*itr~trr 


i_ d y v.-. i_ y Ly l. l 


t 1" at* rrt" na t 


na tat hnraa 

ya l a u ty waa 


aaararrtra 

d d y v— y v— iw.v_.ci 


tattttcctt 


6480 


aaoaattaca 

y uy na u t a 


uao^ayy Ly c- 


aacacattct 


ttcaateratc 


aactaacaat 

t* y** \£ a 


taccttgegt 


6540 


y ^-oy a u y v_, y d 


atapaacaaa 


aacpnt t tat 


caa a t pa at" a 


ataaarraoa 

Cl C V-J Ci CL VwJ CI 


egacaggegt 


6600 


i_ L-ddyydcyy 


ay a luoa l. 


ci L.du u y u a 


day y ay d i_ > — 


□ a l_ u. Lyvj wUQ 




6660 


atr^trrtt^^ 




r~*aatn;"}tnnt 


nt^s rrra nna 


proaoaaata 

^•^yciycicici lu 


caattttatt 

v»» u y c c \- v. y c \- 


6720 


Gdddydyd l. v_- 


<^ay k^y uu^y v. 


i^aciaa^ua l. 


nnrt^tcaflfl 

y y v_» i_ d L^aoa 


atrraaatca 


t* t craacrccac 

v. tyyuy v^ay 


6780 


ntaaatn^tt 


atatftPtaa 


«\»a uyja tyyy 


aorroarrtaa 
c»yv»-^yciy u d d 


t"t"naat"t"crac 

c. lyocn. Ly ak. 


ccrcratcttna 


6840 


v.- „yydc*cici.i^ 


^aa uyat -aa 


add u y V— ci y a 


y y dd L. i_ l ql a 


uy u uy aL.y l l 


^rrtnrnnPr 

y ^uv.y^yyciV' 


6900 


acoaat acaa 

CI <- CI CI V-^ CI u 


ccaacgcaaa 


aataattttt 


aataatggca 


gcgcccaag t 


geceggt cag 


6960 


CI C4 1* 1^ u VJ l** V> ^ 


crctt t era 1 1 a 

yv^vuv.yoi»^u 


catoctaaat 


aatttatata 


ataactcaaa 


ct taagcggt 


7020 


tctrt" trccc 


a t t n a era ara 

d ay yy l-u 


GL VJ VJ Vw I- O NJ Q v^m 


oaact tacca 

^ *>* V_-» V_ ^ *w V-» 


aaagaaacca 


t caatga t gg 


7080 


tttctttttt 


ottcataaat 


cagacaaaac 


ttttctcttg 


caaaagt t tg 


tgaagtgt tg 


7140 


cacaatataa 


atgtgaaata 


cttcacaaac 


aaaaagacat 


caaagagaaa 


cataccctgc 


7200 


aaqqatqctq 


atattgtctg 


catttgcgcc 


ggagcaaacc 


aaaaacctgg 


tgagacaege 


7260 


cttgaattag 


tagaaaagaa 


cttgaagatt 


ttcaaaggca 


tcgttagtga 


agtcatggcg 


7320 


aqcqqatttq 


aeggcatttt 


cttaqtcqqq 


cggcacctcg 


etaaeggatt 


caccactcca 


7380 


agaattqgag 


ccaatcaatt 


ettqeqgaga 


actgtgaatg 


cgcaaaccaa 


cccttggcag 


7440 


aacatatcca 


tcgcgtccgc 


catctccagc 


agccgcacgc 


ggcgcatctc 


gggcagcgtt 


7500 


gggtcctggc 


cacgggtgcg 


catgategtg 


ctcctgtcgt 


tgaggacccg 


gctaggctgg 


7560 


cggggttgcc 


ttactggtta 


gcagaatgaa 


tcaccgatac 


gegagegaac 


gtgaagegae 


7620 


tgctgctgca 


aaacgtctgc 


gacctgagca 


acaacatgaa 


tggtcttegg 


tttccgtgtt 


7680 


tcgtaaagtc 


tggaaacgcg 


gaagtcagcg 


ccctgcacca 


ttatgttccg 


gatctgeate 


7740 


gcaggatgct 


gctggctacc 


ctgtggaaca 


cctacatctg 


tattaacgaa 


gcgctggcat 


7800 
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tgaccctgag 


tgatttttct 


ctggtcccgc 


cgcatccata 


ccgccagttg 


tttaccctca 


7860 


caacgttcca 


gtaaccgggc 


atgttcatca 


tcagtaaccc 


gtatcgtgag 


catcctctct 


7920 


cgtttcatcg 


gtatcattac 


ccccatgaac 


agaaatcccc 


ettacaegga 


ggcatcagtg 


7980 


accaaacagg 


aaaaaaccgc 


ccttaacatg 


gcccgcttta 


teagaageca 


gacattaacg 


8040 


cttctggaga 


aactcaacga 


gctggacgcg 


gatgaacagg 


cagacatctg 


tgaatcgett 


8100 


cacgaccacg 


ctgatgagct 


ttaccgcagc 


tgcctcgcgc 


gtttcggtga 


tgacggtgaa 


8160 


aacctctgac 


acatgcagct 


cccggagacg 


gtcacagctt 


gtctgtaagc 


ggatgccggg 


8220 


agcagacaag 


cccqtcaqqq 


cgcqtcaqcg 


ggtgttggcg 

33 3 33 3 


ggtgtcgggg 


cgcagccatg 


8280 


acccagtcac 


gtagcgatag 


cqqaqtqtat 

3 3 3 3 w 


actggcttaa 


etatgeggea 


tcagagcaga 


8340 


ttgtactgag 


agtgcaccat 


atqcqqtqtq 


aaataccgea 


cagatgegta 


aggagaaaat 


8400 


accgcatcag 


gcgctct tec 


get tcctcgc 


tcactgactc 


qctqcqctcq 


qtcqttcqqc 

3 3 3 3 


8460 


tgcggcgagc 


ggtatcagct 


cact caaagg 


eggtaatacg 


gttatccaca 


qaatcaqqqq 


8520 


ataacgcagg 


aaagaacatg 


tgagcaaaag 


gecagcaaaa 


qqecaqqaac 

3 3 3 3 


cgtaaaaagg 


8580 


ccgcgttgct 


ggcgtttttc 


cataggctcc 


gcccccctga 


cgagcatcac 


aaaaatcgac 


8640 


gctcaagtca 


qaqqtqqcqa 


aacccgacag 


gactataaag 


ataccaggcg 


tttccccctg 


8700 


gaagctccct 


cgtgcgctct 


cctgttccga 


ccctgccgct 


taceggatae 


ctgtccgcct 


87 60 


ttctcccttc 


qqqaaqcqtq 

3 3 3 3 3 3 


gcgctttctc 


atagctcacg 


ctgtaggtat 


etcagttegg 


8820 


tgtaqqtcqt 


tcgctccaag 


ctqggctgtq 


tgcacgaacc 


ccccgttcag 


cccgaccgct 


8880 


gcgccttatc 


eggtaactat 


cgtcttgagt 


ccaacccggt 


aagacacgac 


ttatcgccac 


8940 


tggcagcagc 


cactggtaac 


aggattagca 


gagegaggta 


tgtaggcggt 


gctacagagt 


9000 


tcttgaagtg 


gtggcctaac 


tacggctaca 


ctagaaggac 


agtatttggt 


atctgcgctc 


9060 


tgctgaagcc 


agttaccttc 


ggaaaaagag 


ttggtagctc 


ttgatcegge 


aaacaaacca 


9120 


ccgctggtag 


cggtggtttt 


tttgtttgca 


agcagcagat 


tacgegcaga 


aaaaaaggat 


9180 


ctcaagaaga 


i___.i_4_.l_ ,-,4- „ 

tcctttgatc 


ttttctaegg 


ggtctgaege 


tcagtggaac 


gaaaactcac 




gttaagggat 


tttggtcatg 


agattatcaa 


aaaggatctt 


cacctagatc 


cttttaaatt 


9300 


aaaaatgaag 


ttttaaatca 


atctaaagta 


tatatgagta 


aacttggtct 


gacagttacc 


9360 


aatgcttaat 


cagtgaggca 


cctatctcag 


cgatctgtct 


atttegttea 


tccatagttg 


9420 


cctgactccc 


cgtcgtgtag 


ataactacga 


tacgggaggg 


cttaccatct 


ggccccagtg 


9480 


ctgcaatgat 


accgcgagac 


ccacgctcac 


cggctccaga 


tttatcagca 


ataaaccagc 


9540 
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cagccggaag 


ggccgagcgc 


agaagtggtc 


ctgcaacttt 


atccgcctcc 


atccagtcta 


9600 


ttaattgttg 


ccgggaagct 


agagtaagta 


gttcgccagt 


taatagtttg 


cgcaacgt tg 


9660 


ttgccattgc 


tgcaggcatc 


qtqqtqtcac 

ZJ Z} Z) ZJ 


gctcgtcgtt 


tqqtatqqct 


tcattcagct 


9720 


ccggttccca 


acgatcaagg 


cgagttacat 


gatcccccat 


gttgtgcaaa 


aaagcggtta 


9780 


gctccttcgg 


tcctccgatc 


gttgtcagaa 


gtaagttggc 


cgcagtgtta 


tcactcat gg 


9840" 


ttatggcagc 


actgcataat 


tctct tactg 


tcatgccat c 


cgtaagatgc 


ttttctqtaa 


9900 


ctggtgagta 


ct caaccaag 


tcattctgag 


aatagtgtat 


vj w vj v-A v»» 


aottactctt 


9960 


qcccqqcatc 


aacacgggat 


aataccgcgc 


cacatagcag 


aactt taaaa 


at act cat ca 


10020 


ttggaaaacg 


ttcttcaaaa 


caaaaactct 


caancrat ct t 


accactatto 


a a at ccant t 


10080 


cgatgtaacc 


cactcgtgca 


cccaactgat 


ctt cage ate 


ttttactttc 


accagcgttt 


10140 


ctgggtgagc 


aaaaacagga 


aggcaaaatg 


ccgcaaaaaa 


gggaataagg 


gcgacacgga 


10200 


aatgttgaat 


actcatactc 


ttcctttttc 


aatattattg 


aagcatttat 


cagggttatt 


10260 


gtctcatgag 


cggatacata 


tttgaatgta 


tttagaaaaa 


taaacaaata 


ggggttccgc 


10320 


gcacatttcc 


ccgaaaagtg 


ccacctgacg 


tctaagaaac 


cattattatc 


atgacattaa 


10380 


cctataaaaa 


taggcgtatc 


aegaggccct 


ttegtcttea 


agaatt 




10426 



<210> 83 
<211> 4191 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Recombinant 
pAN263 plasmid 

<400> 83 

ccaatacgca aaccgcctct ccccgcgcgt tggecgatte attaatgeag cccggatccg 60 

aaagaagagg atatcceggt tttacagaag gcattggatg atccaaaggt gtccatcaga 120 

agacaggctg ttgtgtactt aggaatgatt gaaacacctg atgttcttcc tctattgtat 180 

aaagcacttg aggacaaagc tgtatcagtc agaagaaegg ccggagactg cctgtctgat 240 

ateggegate ctcaagccat tectgetatg atcaagtcat taagegaetc cagcaagctt 300 

gttcgctggc gtgccgccat gttcctgtac gaagtcggcg atgaaagtgc aattgaagct 360 

ttgcgcgctg ccgaagatga ccccgaattt gaggtcagee ttcaagtcaa aatggcgctt 420 

gaacgtattg agcatggaga agaagcaaaa ggttctgttt ggaaacaaat gaeggaaage 480 
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agaaaaaaag 


gcgaataaag 


ataaaaaagg 


tgcagatcat 


gcaccttttt 


tatgtgaatt 


540 


ggtcgaccga 


aacagcagtt 


ataaggcatg 


aagctgtccg 


gtttttgcaa 


aagtggctgt 


600 


gactgtaaaa 


agaaatcgaa 


aaagaccgtt 


ttgtgtgaaa 


acggtctttt 


tgtttccttt 


660 


taaccaactg 


ccataactcg 


aggcctacct 


agcttccaag 


aaagatatcc 


taacagcaca 


720 


agagcggaaa 


gatgttttgt 


tctacatcca 


gaacaacctc 


tgctaaaatt 


cctgaaaaat 


780 


tttgcaaaaa 


gttgttgact 


ttatctacaa 


ggtgtggtat 


aataatctta 


acaacagcag 


840 


gacgctctag 


aggaggagac 


accatggcag 


aattacgcag 


taatatgatc 


acacaaggaa 


900 


tcgatagagc 


tccgcaccgc 


agtttgcttc 


gtgcagcagg 


ggtaaaagaa 


gaggatttcg 


960 


gcaagccgtt 


tattgcggtg 


tgtaattcat 


acattgatat 


cgttcccggt 


catgttcact 


1020 


tgcaggagtt 


tgggaaaatc 


gtaaaagaag 


caatcagaga 


agcagggggc 


gttccgtttg 


1080 


aatttaatac 


cattggggta 


gatgatggca 


tcgcaatggg 


gcatatcggt 


atgagatatt 


1140 


cgctgccaag 


ccgtgaaatt 


atcgcagact 


ctgtggaaac 


ggttgtatcc 


gcacactggt 


1200 


ttgacggaat 


ggtctgtatt 


ccgaactgcg 


acaaaatcac 


accgggaatg 


cttatggcgg 


1260 


caatgcgcat 


caacattccg 


acgatttttg 


tcagcggcgg 


accgatggcg 


gcaggaagaa 


1320 


caagttacgg 


gcgaaaaatc 


tccctttcct 


cagtattcga 


aggggtaggc 


gcctaccaag 


1380 


cagggaaaat 


caacgaaaac 


gagcttcaag 


aactagagca 


gttcggatgc 


ccaacgtgcg 


1440 


gqtcttgctc 


aggcatgttt 


acggcgaact 


caatgaactg 


tctgtcagaa 


gcacttggtc 


1500 


ttgctttgcc 


gggtaatgga 


accattctgg 


caacatctcc 


ggaacgcaaa 


gagtttgtga 


1560 


gaaaatcggc 


tgcgcaatta 


atggaaacga 


ttcgcaaaga 


tatcaaaccg 


cgtgatattg 


1620 


ttacagtaaa 


agcgattgat 


aacgcgtttg 


cactcgatat 


ggcgctcgga 


ggttctacaa 


1680 


ataccgttct 


tcataccctt 


gcccttgcaa 


acgaagccgg 


cgttgaatac 


tctttagaac 


1740 


gcattaacga 


agtcgctgag 


cgcgtgccgc 


acttggctaa 


gctggcgcct 


gcatcggatg 


1800 


tgtttattga 


agatcttcac 


gaagcgggcg 


gcgtttcagc 


ggctctgaat 


gagctttcga 


1860 


agaaagaagg 


agcgcttcat 


ttagatgcgc 


tgactgttac 


aggaaaaact 


cttggagaaa 


1920 


ccattgccgg 


acatgaagta 


aaggattatg 


acgtcattca 


cccgctggat 


caaccattca 


1980 


ctgaaaaggg 


aggccttgct 


gttttattcg 


gtaatctagc 


tccggacggc 


gctatcatta 


2040 


aaacaggcgg 


cgtacagaat 


gggattacaa 


gacacgaagg 


gccggctgtc 


gtattcgatt 


2100 


ctcaggacga 


ggcgcttgac 


ggcattatca 


accgaaaagt 


aaaagaaggc 


gacgttgtca 


2160 


tcatcagata 


cgaagggcca 


aaaggcggac 


ctggcatgcc 


ggaaatgctg 


gcgccaacat 


2220 
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cccaaatcgt tggaatggga 
ccggagcctc ccgtggcctc 
cgcttgcctt tgttgaaaac 
atgtacaagt gccagaagaa 
cgaaagtgaa aaccggctac 
gcggtattat gaaaatctag 
ttcccaacag ttgcgcagcc 
taatgctctg ccagtgttac 
tcaaatgaaa ctgcaattta 
gtttctgtaa tgaaggagaa 
atcggtctgc gattccgact 
aaataaggtt atcaagtgag 
aaaggttatg catttctttc 
aatcactcgc atcaaccaaa 
egcgatcgct gttaaaagga 
ctgccagcgc atcaacaata 
ctgttttccc agggatcgca 
gcttgatggt cggaagaggc 
taacatcatt ggcaacgcta 
tcccatacaa tcaatagatt 
acccatataa atcagcatcc 
cccgttgaat atggctcata 
ttgttcatga tgatatattt 
gacaagatga tcttcttgag 
aaaaaccgcc ttgcagggcg 
aggtaactgg cttggaggag 
ggcgcatgac ttcaagacta 
cttttgcatg tctttccggg 
cggactgaac ggggggttcg 
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ctcgggccaa 


aagtggcatt 


gattacggac 


ggacgttttt 


2280 


tcaatcggcc 


acgtatcacc 


tgaggccgct 


gagggcgggc 


2340 


ggagaccata 


ttatcgttga 


tattgaaaaa 


cgcatcttgg 


2400 


gagtgggaaa 


aacgaaaagc 


gaactggaaa 


ggttttgaac 


2460 


ctggcacgtt 


attctaaact 


tgtgacaagt 


gccaacaccg 


2520 


acccctggcg 


taatagcgaa 


gaggcccgca 


ccgatcgccc 


2580 


tgaatggcga 


atgagcttgc 


gccgtcccgt 


caagtcagcg 


2640 


aaccaattaa 


ccaattctga 


ttagaaaaac 


tcatcgagca 


2700 


ttcatatcag 


gattatcaat 


accatatttt 


tgaaaaagcc 


.2760 


aactcaccga 


ggcagttcca 


taggatggca 


agatcctggt 


2820 


cgtccaacat 


caatacaacc 


tattaatttc 


ccctcgtcaa 


2880 


aaatcaccat 


gagtgacgac 


tgaatccggt 


gagaatggca 


2940 


cagacttgtt 


caacaggcca 


gccattacgc 


tcgtcatcaa 


3000 


ccgttattca 


ttcgtgattg 


cgcctgagcg 


agacgaaata 


3060 


caattacaaa 


caggaatcga 


atgcaaccgg 


cgcaggaaca 


3120 


ttttcacctg 


aatcaggata 


ttcttctaat 


acctggaatg 


3180 


gtggtgagta 


accatgcatc 


atcaggagta 


cggataaaat 


3240 


ataaattccg 


tcagccagtt 


tagtctgacc 


atctcatctg 


3*300 


cctttgccat 


gtttcagaaa 


caactctggc 


gcatcgggct 


3360 


gtcgcacctg 


attgcccgac 


attatcgcga 


gcccatttat 


3420 


atgttggaat 


ttaatcgcgg 


cctcgacgag 


caagacgttt 


3480 


acaccccttg 


tattactgtt 


tatgtaagca 


gacagtttta 


3540 


ttatcttgtg 


caatgtaaca 


tcagagattt 


tgagacactc 


3600 


atcgttttgg 


tctgcgcgta 


atctcttgct 


ctgaaaacga 


3660 


gtttttcgaa 


ggttctctga 


gctaccaact 


ctttgaaccg 


3720 


cgcagtcacc 


aaaacttgtc 


ctttcagttt 


agccttaacc 


3780 


actcctctaa 


atcaattacc 


agtggctgct 


gccagtggtg 


3840 


ttggactcaa 


gacgatagtt 


accggataag 


gcgcagcggt 


3900 


tgcatacagt 


ccagcttgga 


gcgaactgcc 


tacccggaac 


3960 
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tgagtgtcag gcgtggaatg agacaaacgc ggccataaca gcggaatgac accggtaaac 4020 

cgaaaggcag gaacaggaga gcgcacgagg gagccgccag gggaaacgcc tggtatcttt 4080 

atagtcctgt cgggtttcgc caccactgat ttgagcgtca gatttcgtga tgcttgtcag 4140 

gggggcggag cctatggaaa aacggctttg ccgcggccct ctcacttccc t 4191 



<210> 84 

<211> 702 

<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (699) 

<400> 84 

ttg tta ctg gtt ate gat gtg ggg aac acc aat act gta ctt ggt gta 48 

Met Leu Leu Val He Asp Val Gly Asn Thr Asn Thr Val Leu Gly Val 
5 10 15 

tat cat gat gga aaa tta gaa tat cac tgg cgt ata gaa aca age agg 96 
Tyr His Asp Gly Lys Leu Glu Tyr His Trp Arg He Glu Thr Ser Arg 
20 25 30 

cat aaa aca gaa gat gag ttt ggg atg att ttg cgc tec tta ttt gat 144 
His Lys Thr Glu Asp Glu Phe Gly Met He Leu Arg Ser Leu Phe Asp 
35 40 45 

cac tec ggg ctt atg ttt gaa cag ata gat ggc att att att teg tea 192 
His Ser Gly Leu Met Phe Glu Gin He Asp Gly He He He Ser Ser 
50 55 60 

gta gtg ccg cca ate atg ttt gcg tta gaa aga atg tgc aca aaa tac 240 
Val Val Pro Pro He Met Phe Ala Leu Glu Arg Met Cys Thr Lys Tyr 
65 70 75 80 

ttt cat ate gag cct caa att gtt ggt cca ggt atg aaa acc ggt tta 288 
Phe His lie Glu Pro Gin He Val Gly Pro Gly Met Lys Thr Gly Leu 
85 90 95 

aat ata aaa tat gac aat ccg aaa gaa gta ggg gca gac aga ate gta 336 
Asn He Lys Tyr Asp Asn Pro Lys Glu Val Gly Ala Asp Arg lie Val 
100 105 110 

aat get gtc get gcg ata cac ttg tac ggc aat cca tta att gtt gtc 384 
Asn Ala Val Ala Ala lie His Leu Tyr Gly Asn Pro Leu He Val Val 
115 120 125 

gat ttc gga acc gee aca acg tac tgc tat att gat gaa aac aaa caa 432 
Asp Phe Gly Thr Ala Thr Thr Tyr Cys Tyr He Asp Glu Asn Lys Gin 
130 135 140 

tac atg ggc ggg gcg att gec cct ggg att aca att teg aca gag gcg 480 
Tyr Met Gly Gly Ala lie Ala Pro Gly lie Thr lie Ser Thr Glu Ala 
145 150 155 160 
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ctt tac teg cgt gca gca aag ctt cct cgt ate gaa ate acc egg ccc 528 

Leu Tyr Ser Arg Ala Ala Lys Leu Pro Arg lie Glu lie Thr Arg Pro 

165 170 175 

gac aat att ate gga aaa aac act gtt age gcg atg caa tct gga att 576 

Asp Asn He lie Gly Lys Asn Thr Val Ser Ala Met Gin Ser Gly He 
180 185 190 

tta ttt ggc tat gtc ggc caa gtg gaa gga ate gtt aag cga atg aaa 624 

Leu Phe Gly Tyr Val Gly Gin Val Glu Gly He Val Lys Arg Met Lys 
195 200 205 

tgg cag gca aaa cag gac cca agg tea ttg cga cag gag gee tgg cgc 672 

Trp Gin Ala Lys Gin Asp Pro Arg Ser Leu Arg Gin Glu Ala Trp Arg 
210 k 215 220 

cgc tea ttg cga acg aat cag att gta tag 702 
Arg Ser Leu Arg Thr Asn Gin He Val 
225 230 



<210> 85 
<211> 233 
<212> PRT 

<213> Bacillus subtilis 
<400> 85 

Met Leu Leu Val He Asp Val Gly Asn Thr Asn Thr Val Leu Gly Val 
1 5 10 15 

Tyr His Asp Gly Lys Leu Glu Tyr His Trp Arg He Glu Thr Ser Arg 
20 25 30 

His Lys Thr Glu Asp Glu Phe Gly Met He Leu Arg Ser Leu Phe Asp 
35 40 45 

His Ser Gly Leu Met Phe Glu Gin lie Asp Gly He He He Ser Ser 
50 55 60 

Val Val Pro Pro He Met Phe Ala Leu Glu Arg Met Cys Thr Lys Tyr 
65 70 75 80 

Phe His He Glu Pro Gin He Val Gly Pro Gly Met Lys Thr Gly Leu 
85 90 95 

Asn He Lys Tyr Asp Asn Pro Lys Glu Val Gly Ala Asp Arg He Val 
100 105 110 

Asn Ala Val Ala Ala He His Leu Tyr Gly Asn Pro Leu He Val Val 
115 120 125 

Asp Phe Gly Thr Ala Thr Thr Tyr Cys Tyr lie Asp Glu Asn Lys Gin 
130 135 140 

Tyr Met Gly Gly Ala He Ala Pro Gly He Thr He Ser Thr Glu Ala 
145 150 155 160 
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Leu Tyr Ser Arg Ala Ala Lys Leu Pro Arg lie Glu lie Thr Arg Pro 
165 170 175 

Asp Asn He He Gly Lys Asn Thr Val Ser Ala Met Gin Ser Gly He 
180 185 190 

Leu Phe Gly Tyr Val Gly Gin Val Glu Gly He Val Lys Arg Met Lys 
195 200 205 

Trp Gin Ala Lys Gin Asp Pro Arg Ser Leu Arg Gin Glu Ala Trp Arg 
210 215 220 

Arg Ser Leu Arg Thr Asn Gin He Val 

225 230 . 

<210> 86 
<211> 1623 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (1620) 

<400> 86 

atg tat ttg gca ttc cag gtg caa aaa ttg atg egg tat ttg acg ctt 48 

Met Tyr Leu Ala Phe Gin Val Gin Lys Leu Met Arg Tyr Leu Thr Leu 

1 5 10 15 

tac aag ata aag gac ctg aaa tta teg ttg ccc ggc acg aac aaa acg 96 
Tyr Lys He Lys Asp Leu Lys Leu Ser Leu Pro Gly Thr Asn Lys Thr 
20 . 25 . 30 

cag caa ttc atg gec caa gca gtc ggc cgt tta act gga aaa ccg gga 144 
Gin Gin Phe Met Ala Gin Ala Val Gly Arg Leu Thr Gly Lys Pro Gly 
35 40 45 

gtc gtg tta gtc aca tea gga ccg ggt gec tct aac ttg gca aca ggc 192 
Val Val Leu Val Thr Ser Gly Pro Gly Ala Ser Asn Leu Ala Thr Gly 
50 55 60 

ctg ctg aca gcg aac act gaa gga gac cct gtc gtt gcg ctt get gga 240 
Leu Leu Thr Ala Asn Thr Glu Gly Asp Pro Val Val Ala Leu Ala Gly 
65 70 75 80 

aac gtg ate cgt gca tat cgt tta aaa egg aca cat caa tct ttg gat 288 
Asn Val He Arg Ala Tyr Arg Leu Lys Arg Thr His Gin Ser Leu Asp 
85 " 90 95 

aat gcg gcg eta ttc cag ccg att aca aaa tac agt gta gaa gtt caa 336 
Asn Ala Ala Leu Phe Gin Pro He Thr Lys Tyr Ser Val Glu Val Gin 
100 105 110 

gat gta aaa aat ata ccg gaa get gtt aca aat gca ttt agg ata gcg 384 
Asp Val Lys Asn He Pro Glu Ala Val Thr Asn Ala Phe Arg He Ala 
115 120 125 



tea gca ggg cag get ggg gec get ttt gtg age ttt ccg caa gat gtt 4 32 
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Ser Ala Gly Gin Ala Gly Ala Ala Phe Val Ser Phe Pro Gin Asp Val 
130 135 140 

gtg aat gaa gtc aca aat acg aaa aac gtg cgt get gtt gca gcg cca 480 
Val Asn Glu Val Thr Asn Thr Lys Asn Val Arg Ala Val Ala Ala Pro 
145 150 155 160 

aaa etc ggt cct gca gca gat gat gca ate agt gcg gec ata gca aaa 528 
Lys Leu Gly Pro Ala Ala Asp Asp Ala lie Ser Ala Ala lie Ala Lys 
165 170 175 

ate caa aca gca aaa ctt cct gtc gtt ttg gtc ggc atg aaa ggc gga 576 
He Gin Thr Ala Lys Leu Pro Val Val Leu Val Gly Met Lys Gly Gly 
180 185 190 

aga ccg gaa gca att aaa gcg gtt cgc aag ctt ttg aaa aag gtt cag 624 
Arg Pro Glu Ala He Lys Ala Val Arg Lys Leu Leu Lys Lys Val Gin 
195 200 205 

ctt cca ttt gtt gaa aca tat caa get gee ggt ace ctt tct aga gat 672 
Leu Pro Phe Val Glu Thr Tyr Gin Ala Ala Gly Thr Leu Ser Arg Asp 
210 215 220 

tta gag gat caa tat ttt ggc cgt ate ggt ttg ttc cgc aac cag cct 720 
Leu Glu Asp Gin Tyr Phe Gly Arg lie Gly Leu Phe Arg Asn Gin Pro 
225 230 235 240 

ggc gat tta ctg eta gag cag gca gat gtt gtt ctg acg ate ggc tat 768 
Gly Asp Leu Leu Leu Glu Gin Ala Asp Val Val Leu Thr He Gly Tyr 
245 250 255 

gae ccg att gaa tat gat ccg aaa ttc tgg aat ate aat gga gac egg 816 
Asp Pro He Glu Tyr Asp Pro Lys Phe Trp Asn lie Asn Gly Asp Arg 
260 265 270 

aca att ate cat tta gac gag att ate get gac att gat cat get tac 864 
Thr He lie His Leu Asp Glu lie lie Ala Asp lie Asp His Ala Tyr 
275 280 285 

cag cct gat ctt gaa ttg ate ggt gac att ccg tec acg ate aat cat 912 
Gin Pro Asp Leu Glu Leu lie Gly Asp lie Pro Ser Thr He Asn His 
290 295 300 

ate gaa cac gat get gtg aaa gtg gaa ttt gca gag cgt gag cag aaa 960 
lie Glu His Asp Ala Val Lys Val Glu Phe Ala Glu Arg Glu Gin Lys 
305 310 315 320 

ate ctt tct gat tta aaa caa tat atg cat gaa ggt gag cag gtg cct 1008 
lie Leu Ser Asp Leu Lys Gin Tyr Met His Glu Gly Glu Gin Val. Pro 
325 330 335 

gca gat tgg aaa tea gac aga gcg cac cct ctt gaa ate gtt aaa gag 1056 
Ala Asp Trp Lys Ser Asp Arg Ala His Pro Leu Glu He Val Lys Glu 
340 345 350 

ttg cgt aat gca gtc gat gat cat gtt aca gta act tgc gat ate ggt 1104 
Leu Arg Asn Ala Val Asp Asp His Val Thr Val Thr Cys Asp lie Gly 
355 360 365 
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tcg cac tec att tgg atg tea cgt tat ttc cgc age tac gag ccg tta 1152 
Ser His Ser lie Trp Met Ser Arg Tyr Phe Arg Ser Tyr Glu Pro Leu 
370 375 380 

aca tta atg ate agt aac ggt atg caa aca etc ggc gtt gcg ctt cct 1200 
Thr Leu Met lie Ser Asn Gly Met Gin Thr Leu Gly Val Ala Leu Pro 
385 390 * 395 * 400 

tgg gca ate ggc get tea ttg gtg aaa ccg gga gaa aaa gtg gtt tct 1248 
Trp Ala lie Gly Ala Ser Leu Val Lys Pro Gly Glu Lys Val Val Ser 
405 410 415 

gtc t.ct ggt gac ggc ggt ttc tta ttc tea gca atg gaa tta gag aca 1296 
Val Ser Gly Asp Gly Gly Phe Leu Phe Ser Ala Met Glu Leu Glu Thr 
420 425 430 

gca gtt cga eta aaa gca cca att gta cac att gta tgg aac gac age 1344 
Ala Val Arg Leu Lys Ala Pro lie Val His lie Val Trp Asn Asp Ser 
435 440 445 

aca tat gac atg gtg cat ttc cag caa ttg aaa aaa tat aac cgt aca 1392 
Thr Tyr Asp Met Val His Phe Gin Gin Leu Lys Lys Tyr Asn Arg Thr 
450 455 460 

tct gcg gtc gat ttc gga aat ate gat ate gtg aaa tat gcg gaa age 1440 
Ser Ala Val Asp Phe Gly Asn lie Asp lie Val Lys Tyr Ala Glu Ser 
465 470 475 480 

ttc gga gca act gcg ttg cgc gta gaa tea cca gac cag ctg gca gat 1488 
Phe Gly Ala Thr Ala Leu Arg Val Glu Ser Pro Asp Gin Leu Ala Asp 
485 490 495 

gtt ctg cgt caa ggc atg aac get yaa ggt cct gtc ate ate gat gtc 1536 
Val Leu Arg Gin Gly Met Asn Ala Glu Gly Pro Val He lie Asp Val 
500 505 510 

ccg gtt gac tac agt gat aac att aat tta gca agt gac aag ctt ccg 1584 
Pro Val Asp Tyr Ser Asp Asn He Asn Leu Ala Ser Asp Lys Leu Pro 
515 520 525 

aaa gaa ttc ggg gaa etc atg aaa acg aaa get etc tag 1623 
Lys Glu Phe Gly Glu Leu Met Lys Thr Lys Ala Leu 
530 535 540 



<210> 87 
<211> 540 
<212> PRT 

<213> Bacillus subtilis 
<400> 87 

Met Tyr Leu Ala Phe Gin Val Gin Lys Leu Met Arg Tyr Leu Thr Leu 
1 5 10 15 

Tyr Lys He Lys Asp Leu Lys Leu Ser Leu Pro Gly Thr Asn Lys Thr 
20 25 ' 30 
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Gln Gin Phe Met Ala Gin Ala 
35 



Val Gly Arg Leu 
4 0 



Thr Gly Lys Pro Gly 
45 



Val Val Leu Val Thr Ser Gly 
50 55 



Pro Gly Ala Ser 



Asn Leu Ala Thr Gly 
60 



Leu Leu Thr Ala Asn Thr Glu 
65 70 



Gly Asp Pro Val 
75 



Val Ala Leu Ala Gly 
80 



Asn Val lie Arg Ala Tyr Arg 
85 



Leu Lys Arg Thr 
90 



His Gin Ser Leu Asp 
95 



Asn Ala Ala Leu Phe Gin Pro 
100 



lie Thr Lys Tyr 
105 



Ser Val Glu Val Gin 
110 



Asp Val Lys Asn lie Pro Glu 
115 



Ala Val Thr Asn 
120 



Ala Phe Arg lie Ala 
125 



Ser Ala Gly Gin Ala Gly Ala 
130 135 



Ala Phe Val Ser 



Phe Pro Gin Asp Val 
140 



Val Asn Glu Val Thr Asn Thr 
145 150 



Lys Asn Val Arg 
155 



Ala Val Ala Ala Pro 
160 



Lys Leu Gly Pro Ala Ala Asp 
165 



Asp Ala lie Ser 
170 



Ala Ala lie Ala Lys 
175 



lie Gin Thr Ala Lys Leu Pro 
180 



Val Val Leu Val 
185 



Gly Met Lys Gly Gly 
190 



Arg Pro Glu Ala lie Lys Ala 
195 



Val Arg Lys Leu 
200 



Leu Lys Lys Val Gin 
205 



Leu Pro Phe Val Glu Thr Tyr 
2.10 215 



Gin Ala Ala Gly 



Thr Leu Ser Arg Asp 
220 



Leu Glu Asp Gin Tyr Phe Gly 
225 230 



Arg lie Gly Leu 
235 



Phe Arg Asn Gin Pro 
240 



Gly Asp Leu Leu Leu Glu Gin 
245 



Ala Asp Val Val 
250 



Leu Thr lie Gly Tyr 
255 



Asp Pro lie Glu Tyr Asp Pro 
260 

Thr lie lie His Leu Asp Glu 
275 



Lys Phe Trp Asn 
265 

lie lie Ala Asp 
280 



lie Asn Gly Asp Arg 
270 

lie Asp His Ala Tyr 
285 



Gin Pro Asp Leu Glu Leu lie 
290 295 



Gly Asp lie Pro 



Ser Thr lie Asn His 
300 



He Glu His Asp Ala Val Lys 
305 310 



Val Glu Phe Ala 
315 



Glu Arg Glu Gin Lys 
320 



He Leu Ser Asp Leu Lys Gin 
325 



Tyr Met His Glu 
330 



Gly Glu Gin Val Pro 

335 



Ala Asp Trp Lys Ser Asp Arg Ala His Pro Leu Glu He Val Lys Glu 
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340 345 350 

Leu Arg Asn Ala Val Asp Asp His Val Thr Val Thr Cys Asp He Gly 
355 360 365 

Ser His Ser He Trp Met Ser Arg Tyr Phe Arg Ser Tyr Glu Pro Leu 
370 375 380 

Thr Leu Met He Ser Asn Gly Met Gin Thr Leu Gly Val Ala Leu Pro 
385 390 " 395 400 

Trp Ala He Gly Ala Ser Leu Val Lys Pro Gly Glu Lys Val Val Ser 
405 410 415 

Val Ser Gly Asp Gly Gly Phe Leu Phe Ser Ala Met Glu Leu Glu Thr 
420 425 430 

Ala Val Arg Leu Lys Ala Pro He Val His He Val Trp Asn Asp Ser 
435 440 445 

Thr Tyr Asp Met Val His Phe Gin Gin Leu Lys Lys Tyr Asn Arg Thr 
450 455 460 

Ser Ala Val Asp Phe Gly Asn He Asp He Val Lys Tyr Ala Glu Ser 
465 470 475 480 

Phe Gly Ala Thr Ala Leu Arg Val Glu Ser Pro Asp Gin Leu Ala Asp 
485 490 495 

Val Leu Arg Gin Gly Met Asn Ala Glu Gly Pro Val He He Asp Val 
500 505 510 

Pro Val Asp Tyr Ser Asp Asn lie Asn Leu Ala Ser Asp Lys Leu Pro 
515 520 525 

Lys Glu Phe Gly Glu Leu Met Lys Thr Lys Ala Leu 
530 535 540 



<210> 88 
<211> 23 
<212> DNA 

<213> Artificial Sequence' 
<220> 

<223> Description of Artificial Sequence: ribosome 
binding site 

<220> 

<223> All occurrences of n indicate any nucleotide 
<400> 88 

agaaaggagg tgannnnnnn atg 23 



<210> 89 
<211> 7 
<212> PRT 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PanC 
C terminus 

<400> 89 

lie Arg Glu Met Glu Arg lie 
1 5 



<210> 90 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PanC 
C terminus 

<400> 90 

lie Arg Glu Arg Arg 
1 5 



<210> 91 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PanC 
C terminus 

<400> 91 

lie Arg Arg Lys Glu Val Asn 
1 5 

<210> 92 
<211> 6688 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Recombinant 
pAN336 plasmid 

<400> 92 

tgcgccgcta cagggcgcgt ccattcgcca ttcaggctgc gcaactgttg ggaagggcga 60 

tcggtgcggg cctcttcgct attacgccag ctggcgaaag ggggatgtgc tgcaaggcga 120 

ttaagttggg taacgccagg gttttcccag tcacgacgtt gtaaaacgac ggccagtgaa 180 

ttgtaatacg actcactata gggcgaattg ggcccgacgt cgcatgcacc aggcttctca 240 

ggcgctgact tagaaaacct cttgaatgaa gctgcgcttg tagcggctcg tcaaaacaag 300 
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aaaaaaatcg 


atgcgcgtga 


tattgacgaa 


gcgacggacc 


gtgtaattgc 


cggacccgct 


360 


aagaagagcc 


gcgttatctc 


caagaaagaa 


cgcaatatcg 


tggcttatca 


cgaaggcgga 


420 


cacaccgtta 


tcggtctcgt 


tttagatgag 


gcagatatgg 


ttcataaagt 


aacgattgtt 


480 


cctcggggcc 


aggctggcgg 


ttatgctgtt 


atgctgccaa 


gagaagaccg 


ttatttccaa 


540 


acaaagccgg 


agctgcttga 


taaaattgtc 


ggcctcttgg 


gcggacgtgt 


tgctgaagag 


600 


attatcttcg 


gtgaagtcag 


cacaggggcg 


cacaatgact 


tccagcgtgc 


gacgaatatt 


660 


gcaagacgaa 


tggttacaga 


attcggtatg 


tcagaaaaac 


tgggaccgtt 


gcaatttgga 


720 


cagtctcagg 


gcggtcaggt 


attcttaggc 


cgtgatttca 


acaacgaaca 


gaactacagt 


780 


gatcaaatcg 


cttacgaaat 


tgatcaggaa 


attcagcgca 


tcatcaaaga 


atgttatga'g 


840 


cgtgcgaaac 


aaatcctgac 


tgaaaatcgt 


gacaagcttg 


aattgattgc 


ccaaacgctt 


900 


ctgaaagttg 


aaacgcttga 


cgctgaacaa 


atcaaacacc 


ttatcgatca 


tggaacatta 


960 


cctgagcgta 


atttctcaga 


tgatgaaaag 


aacgatgatg 


tgaaagtaaa 


cattctgaca 


1020 


aaaacagaag 


aaaagaaaga 


cgatacgaaa 


gagtaattcg 


ctttctttct 


aaaaaaactg 


1080 


ccggctgacg 


ctggcagttt 


ttttatgtaa 


atgattggct 


cagctgcggc 


ttttacaatc 


1140 


atccaat tct 


ggtatcgatt 


tgtttacaaa 


tgagccgctg 


atcgty tatg 


gtattgtaga 


12C0 


atgtttgtaa 


aaagtaaagt 


agagaaacta 


ttcaaaagtg 


gtgatagagg 


ttgttactgg 


1260 


tta tcgatgt 


ggggaacacc 


ctgcagctcg 


agtgaaatac 


cgcacagatg 


cgtaaggaga 


1320 


aaataccgca 


tcaggcgata 


aacccagcga 


accatttgag 


gtgataggta 


agattatacc* 


1380 


gaggtatgaa 


aacgagaatt 


ggacctttac 


agaattactc 


tatgaagcgc 


catatttaaa 


1440 


aagctaccaa 


gacgaagagg 


atgaagagga 


tgaggaggca 


gattgccttg 


aatatattga 


1500 


caatactgat 


aagataatat 


atcttttata 


tagaagatat 


cgccgtatgt 


aaggatttca 


1560 


gggggcaagg 


cataggcagc 


gcgcttatca 


atatatctat 


agaatggqca 


aagcataaaa 


1620 


acttgcatgg 


actaatgctt 


gaaacccagg 


acaataacct 


tatagcttgt 


aaattctatc 


1680 


ataattgtgg 


tttcaaaatc 


ggctccgtcg 


atactatgtt 


atacgccaac 


tttcaaaaca 


1740 


actttgaaaa 


agctgttttc 


tggtatttaa 


ggttttagaa 


tgcaaggaac 


agtgaattgg 


1800 


agttcgtctt 


gttataatta 


gcttcttggg 


gtatctttaa 


atactgtaga 


aaagaggaag 


1860 


gaaataataa 


atggctaaaa 


tgagaatatc 


accggaattg 


aaaaaactga 


tcgaaaaata 


1920 


ccgctgcgta 


aaagatacgg 


aaggaatgtc 


tcctgctaag 


gtatataagc 


tggtgggaga 


1980 


aaatgaaaac 


ctatatttaa 


aaatgacgga 


cagccggtat 


aaagggacca 


cctatgatgt 


2040 
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ggaacgggaa aaggacatga tgctatggct 
gcactttgaa cggcatgatg gctggagcaa 
ttgctcggaa gagtatgaag atgaacaaag 
gtgcatcagg ctctttcact ccatcgacat 
cagccgctta gccgaattgg attacttact 
aaactgggaa gaagacactc catttaaaga 
ggaaaagccc gaagaggaac ttgtcttttc 
tgtgaaagat ggcaaagtaa gtggctttat 
gtggtatgac attgccttct gcgtccggtc 
tgtcgagcta ttttttgact tactggggat 
tattttactg gatgaattgt tttagtacct 
aacjcttttta gacatctaat cttttctgaa 
tttatatctt ttctaaaagt tcgotagata 
tcgccattcg ccattcaggc tgcgcaactg 
caggcaaaac aggacccaag gtcattgcga 
aatcagattg tatagacatc gttgatccat 
atgaa3gaaa ccgcgtagga agtgtatagg 
acttgcgtat gacggaaaag ttcgggctta 
ggggcagaga cgccatggta cgtggccgac 
agcttcactt atgctcggcg ctatgctgaa 
gggcggaggt ccgatcggag ctattgtagc 
ctatgtctct aacccgcaag ttcattttga 
acgtgcggtt ggaacaaacg gaacgttaag 
cttcacagga caagtagaaa tcgtttcagg 
tgtgtcatct gagcaggttc cttcatcagt 
taccattctt gcggcagggg gctttattat 
aatcacaaaa attgaacagc gtctatctca 
agggctgaca ccagaagaaa ttttagaaga 
aacgatgcct gtcagattcc attgcccttg 
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ggaaggaaag ctgcctgttc caaaggtcct 2100 
tctgctcatg agtgaggccg atggcgtcct 2160 
ccctgaaaag attatcgagc tgtatgcgga 2220 
atcggattgt ccctatacga atagcttaga 2280 
gaataacgat ctggccgatg tggattgcga 2340 
tccgcgcgag ctgtatgatt ttttaaagac 2400, 
ccacggcgac ctgggagaca gcaacatctt 24 60 
tgatcttggg agaagcggca gggcggacaa 2520 
gatcagggag gatatcgggg aagaacagta 2580 
caagcctgat tgggagaaaa taaaatatta 2640 
agatttagat gtctaaaaag ctttaactac 2700 
gtacatccgc aactgtccat actctgatgt 2760 
ggggtcccga gcgcctacga ggaatttgta 2820 
ttgggaaggg cgatcggrgc ggtcgactgg 2880 
caggaggcct ggcgccgctc attgcgaacg 2940 
tcttaaccct aaaagggctg gaattgattt 3000 
aggtttagta atggattatt tagtaaaagc 3060 
tgcagcgaga acgactgata tggtaaatga 3120 
agcatccgct gcactaggcc gtacaatgac 3180 
gggcgatgat aagctgaccg tgaaaatcga 3240 
tgatgccaat gccaaaggag aagtcagagc 3300 
tttaaatgaa caaggtaagc ttgatgtcag 3360 
tgtcgtaaaa gatttaggtt tgcgcgagtt 3420 
agaattagga gatgatttta cttactatct 3480 
gggcgtaggt gtgctcgtaa atcctgacaa 3540 
tcagctgatg ccgggaacag atgatgaaac 3600 
agtagagccg atttctaagc tcatccaaaa 3660 
agtcctaggc gagaaacctg agattttgga 3720 
ttcaaaagaa cggttcgaaa cagccatttt 3780 
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aggactaggc 


aaaaaagaaa 


ttcaagatat 


gatagaagaa 


gatggacaag 


ccgaagcagt 


3840 


atgccatttt 


tgtaatgaaa 


agtacttatt 


tacaaaagaa 


gagctggaag 


ggcttcgtga 


3900 


ccaaactacc 


cgctaagctc 


tttagcgggt 


ttttaatttg 


agaaaagggg 


ctgaaagcag 


3960 


gtttgaaatc 


aagaacaatc 


tggacgcgtt 


ggatgcatag 


cttgagtatt 


ctatagtgtc 


4020 


acctaaatag 


cttggcgtaa 


tcatggtcat 


agctgtttcc 


tgtgtgaaat 


tgttatccgc 


4080 


tcacaattcc 


acacaacata 


cgagccggaa 


gcataaagtg 


taaagcctgg 


ggtgcctaat 


4140 


gagtgagcta 


actcacatta 


attgcgttgc 


gctcactgcc 


cgctttccag 


tcgggaaacc 


4200 


tgtcgtgcca 


gctgcattaa 


tgaatcggcc 


aacgcgcggg 


gagaggcggt 


ttgcgtattg 


4260 


ggcgctcttc 


cgcttcctcg 


ctcactgact 


cgctgcgctc 


ggtcgttcgg 


ctgcggcgag 


4320 


cggtatcagc 


tcactcaaag 


gcggtaatac 


ggttatccac 


agaatcaggg 


gataacgcag 


4380 


gaaagaacat 


gtgagcaaaa 


ggccagcaaa 


aggccaggaa 


ccgtaaaaag 


gccgcgttgc 


4440 


tggcgttttt 


cgataggctc 


cgcccccctg 


acgagcatca 


caaaaatcga 


cgctcaagtc 


4500 


agaggtggcg 


aaacccgaca 


ggactataaa 


gataccaggc 


gtttccccct 


ggaagctccc 


4560 


tcgtgcgctc 


tcctgttccg 


accctgccgc 


ttaccggata 


cctgtccgcc 


tttctcccttv' 4 620 


cgggaagcgt 


ggcgctttct 


catagctcac 


gctgtaggta 


tctcagttcg 


gtgtaggtcg 


4 680 


ttcgctccaa 


gctgggctgt 


gtgcacgaac 


cccccgttca 


gccegacegc 


t gcgcc 1 1 a tv 4 740 


ccggtaacta 


tcgtcttgag 


tccaacccgg 


taagacacga 


ct tatogcca 


ctggcagcag 


4800 


ccactggtaa 


caggattagc 


agagcgaggt 


atgtaggcgg 


tgctacagag 


ttcttgaagtV 4860 


ggtggcctaa 


ctacggctac 


actagaagga 


cagtatttgg 


tatctgcgct 


ctgctgaagc 


4920 


cagttacctt 


cggaaaaaga 


gttggtagct 


cttgatccgg 


caaacaaacc 


accgctggta 


4980 


gcggtggttt 


ttttgtttgc 


aagcagcaga 


ttacgcgcag 


aaaaaaagga 


tctcaagaag 


5040 


atcctttgat 


cttttctacg 


gggtctgacg 


ctcagtggaa 


cgaaaactca 


cgttaaggga 


5100 


ttttggtcat 


gagattatca 


aaaaggatct 


tcacctagat 


ccttttaaat 


taaaaatgaa 


5160 


gttttaaatc 


aatctaaagt 


atatatgagt 


aaacttggtc 


tgacagttac 


caatgcttaa 


5220 


tcagtgaggc 


acctatctca 


gcgatctgtc 


tatttcgttc 


atccatagtt 


gcctgactcc 


5280 


ccgtcgtgta 


gataactacg 


atacgggagg 


gcttaccatc 


tggccccagt 


gctgcaatga 


5340 


taccgcgaga 


cccacgctca 


ccggctccag 


atttatcagc 


aataaaccag 


ccagccggaa 


5400 


gggccgagcg 


cagaagtggt 


cctgcaactt 


tatccgcctc 


catccagtct 


attaattgtt 


5460 


gccgggaagc 


tagagtaagt 


agttcgccag 


ttaatagttt 


gcgcaacgtt 


gttggcattg 


5520 
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ctacaggcat 


cgtggtgtca 


cgctcgtcgt 


ttggtatggc 


ttcattcagc 


tccggttccc 


5580 


aacgatcaag 


gcgagttaca 


tgatccccca 


tgttgtgcaa 


aaaagcggtt 


agctccttcg 


5640 


gtcctccgat 


cgttgtcaga 


agtaagttgg ccgcagtgtt 


atcactcatg 


gttatggcag 


5700 


cactgcataa 


ttctcttact 


gtcatgccat 


ccgtaagatg 


cttttctgtg 


actggtgagt 


5760 


actcaaccaa 


gtcattctga 


gaataccgcg 


cccggcgacc 


gagttgctct 


tgcccggcgt 


5820 


caatacggga 


taatagtgta 


tgacatagca 


gaactttaaa 


agtgctcatc 


attggaaaac 


5880 


gttcttcggg 


gcgaaaactc 


tcaaggatct 


taccgctgtt 


gagatccagt 


tcgatgtaac 


5940 


ccactcgtgc 


acccaactga 


tcttcagcat 


cttttacttt 


caccagcgtt 


tctgggtgag 


6000 


caaaaacagg 


aaggcaaaat 


gccgcaaaaa 


agggaataag 


ggcgacacgg 


aaatgttgaa 


6060 


tactcatact 


cttccttttt 


caatattatt 


gaagcattta 


tcagggttat 


tgtctcatga 


6120 


gcggatacat 


atttgaatgt 


atttagaaaa 


ataaacaaat 


aggggttccg 


cgcacatttc 


6180 


cccgaaaagt 


gccacctgta 


tgcggtgtga 


aataccgcac 


agatgcgtaa 


ggagaaaata 


6240 


ccgcatcagg 


cgaaattgta 


aacgttaata 


ttttgttaaa 


attcgcgtta 


aatatttgtt 


6300 


aaatcagctc 


attttttaac 


caataggccg 


aaatcggcaa 


aatcccttat 


aaatcaaaag/' 63 60 


aatagaccga 


gatagggttg 


agtgttgttc 


cagtttggaa 


caagagtcca 


ctattaaaga 


6420 


acgtggactc 


caacgtcaaa 


gggcgaaaaa 


ccgtctatca 


gggcgatggc 


ccactacgtg-^ 6480 


aaccatcacc 


caaatcaagt 


tttttgcggt 


cgaggtgccg 


taaagctcta 


aatcggaacc 


6540 


ctaaagggag 


cccccgattt 


agagcttgac 


ggggaaagcc 


ggcgaacgtg 


gcgagaaagg. 


6600 


aagggaagaa 


agcgaaagga 


gcgggcgcta 


gggcgctggc 


aagtgtagcg 


gtcacgctgc 


6660 


gcgtaaccac 


cacacccgcc 


gcgcttaa 








6688 



<210> 93 
<211> 8503 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Recombinant 
pAN004 plasmid 

<400> 93 

gaattttgcg gccgcttcga aagctgtaat ataaaaacct tcttcaacta acggggcagg 60 

ttagtgacat tagaaaaccg actgtaaaaa gtacagtcgg cattatctca tattataaaa 120 

gccagtcatt aggcctatct gacaattcct gaatagagtt cataaacaat cctgcatgat 180 

aaccatcaca aacagaatga tgtacctgta aagatagcgg taaatatatt gaattacctt 240 
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tattaatgaa 


ttttcctgct 


gtaataatgg 


qtaqaaqqta 


attactatta 


ttattgatat 


300 


ttaagttaaa 


cccagtaaat 


gaagtccatg 


gaa taataga 


aagagaaaaa 


gcattttcag 


360 


qtataqqtqt 


tttgggaaac 


aatttccccg 


aaccatt at a 


t ttctctaca 


t cagaaaggt 


420 


ataaatcata 


aaactctttg 


aagtcattct 


t tacaggagt 


ccaaatacca 


aacraatat 1 1 


480 


tagat acacc 


atcaaaaat t 


otataaaata 


gctctaactt 


atcccaa taa 


rrtaartrtr 


540 


catcactatt 


otaarraatt - 

W I— Ca CI v^* cji \j l l 




tattroaiit t 




y LLai^ i_ciciyci 




aaataaatgc 


agggtaaaat 


ttatatcctt 


cttattttat 


atttcacrtat" 

y i_ v«> t a v. 




660 


ta tcaatt t c 


tataattata 


ctaaaaatca 


tttattaatt 


caaataatcra 




720 


cttttctctt 


ccaattgtct 


aaatcaatt t 


ta 1 1 aaaott 


catttoatat 

* ci c l u y ca c* c-i c* 


cfcr*t"f H f N taaa 


780 


tttttatcta 


aagtgaattt 


aoaaaactta 




t tcttcatta 


aaat^aafrr 

y UU L ^UQ L 


840 


ttttttaaaa 

L L L L L L U U, U. U 


n1~r , aa1~a1~t"3 




aata f atatt 


tfaaaaafat* 

i_ Laadfla L d (_ 


cccactttat 


900 


w N-^ u u \# L L V«p \A 


ttt"nt"t"craap 


•J- 3 a +- rrrrrrt" rtc 


t t"t"arrt*Toi4fl 
Lavji. u. vj o d 




cacattaaaa 


960 


aatgtggtct 


tttatatttt 


ttt*aaaCTaa+* 


t^aanrrrtaci 




ttttctttct 


1020 


tatct taa ta 


ataaaaataa 

<-A ^* C U ^ L CJ C4 


ctattaaatt 




a ci 1 1" 1" »it' a na 


aacgcaaaaa 


1080 


ggccatccgt 


caggatggcc 


ttctgcttaa 


1 1 1 gatgcct 


aacaattta*" 

\J \£ U U- V_- l_ w 


ggcgggcgtc 


1140 


ctgcccgcca 


ccctccgggc 


cgttgcttcg 


caacgt t caa 


atccgctccc 


ggcqgatttq; 1200 


tcctactcag 


gagagcgt tc 


accgacaaac 


aacaqataaa 


a cgaa 3ggcc 


cagtctttcg^ 1260 


actgagcctt 


tcgttttatt 


tgatgcctgg 


cagttccct a 


ctctcgcat g 


gggagacccc 


1320 


acactaccat 


cqqcqctacq 


gcgtttcact 


tctqaqt.tcq 


qcatqqqqt c 


aggtgggacc 


1380 


accgcgctac 


tqccqccaqq 


caaattctgt 


1 1 tatcagac 


cgcttctgcg 


ttctgattta 


1440 


atctgtatca 


ggctgaaaat 


cttctctcat 


ccgccaaaac 


aggatcctac 


ggaaatggag 


1500 


cggcaaaacc 


gttttactct 


caaaatctta 


aaagaaaacc 


cccgataaag 


ggggcttttc 


1560 


ttctacaaaa 


ttgtacgggc 


tggttcgttc 


cccagcattt 


gttcaatttt 


gttttgatca 


1620 


ttcagaacag 


ccactttcgg 


ctcatggctt 


gccgcttctt 


gatcagacat 


cattttgtag 


1680 


gaaataataa 


tgaccttatc 


tccttcctgc 


acaaggcgtg 


cggctgcacc 


gtttaagcat 


1740 


atgacgccgc 


ttccccgttt 


accaggaata 


atatacgttt 


caagacgtgc 


tccattatta 


1800 


ttattcacaa 


tttgtacttt 


ttcattagga 


agcattccca 


cagcatcaat 


gagatcttca 


1860 


tcaattgtaa 


tgcttcccac 


atagttcagg 


tttgcttccg 


taacagttgc 


cctgtgaagt 


1920 


ttgccgctca 


tcattgttcg 


atacatatta 


tattctctcc 


atttctcgaa 


tatcaataat 


1980 
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gatattatct attaaacgcg cttttgaaaa 
agcaatttca ttcacaggct cgagttccgg 
tccgctagtc gtttcaatga tatcttttgc 
accggcttgg acaagttccg cacttgtttg 
tctttcctca gctgttaagt atacattgcg 
gaccgtatcg acaggaacca attcaatatc 
aacagctacc tgctgcgcat cttttaaacc 
gaaaagcttc gtcagtacga tcgcgacccc 
taacacgtct gtgcgtcttt ctacatgaat 
atgagcatct ggcgtaaaaa gaatatcgac 
ctcaatatcg cgcggatatg cttcaaaatc 
aaaaatactc ataataacgg cgtcgttttc 
cccctcatgc agaaacccca tcgtcggaac 
ttgttttatg gcttctttca gctgtgaaat 
cgtacaagcc gtcaagcact gtctggttca 
cacgatgtct tacatcctga acatatccgc 
gcgtatattg ctttacaaat ttaggtgttc 
aaacgagaac ttgtccgtcc gctttcacac 
gcgtctcggc aattttggct gtgagttctg 
ctcctgcttc ttcgcatttt atactgtctt 
tgccctgtac tttatagccg cccagtacgc 
tgactactgg aatgcctcca agcgtcaatg 
cctcaagctt cagtgcgtca gctccgcttt 
tatcttcctt agacaggtga taagacataa 
gcgcacccct tttaacggct tttgtatgat 
ttgaatcaag gccgaggacg accattccaa 
ccgcttgttc agcaagttta gctgccggat 
cttcagactc cttcattttt agaaaatcca 
gagcgtcctg ctgttgttaa gattattata 
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agcaactgca 


acagcgagaa 


tcatctttcc 


2040 


ataggaataa 


agctctacat 


agtctatggt 


2100 


agcttttatc 


accgcttcag 


gatctctttc 


2160 


aaqqqcccqa 


tacagcttag 


gegcttet tt 


2220 


agagct t t tg 


get aagccgt 


cttcctct ct 


2280 


catgaagaag 


tcqctaat ta 


accca tcaac 


2340 


aaaataoaca 

V* W t^A V* 


caaatcoact 


t era ct: a era 1 1 


2400 


gt caaaatgt 


cct t ctcttg 


aacoccraca 

t-i y w ^ w ^ ^ \^ LA 


2460 


catoarattc 


tfttraccaa 




2520 


I- V*. >J ^ l_ l_ 




i_ y ci ulv» 


2580 


u uv^au y y y 








ttatcttact 


i_ y t u t y l. u u 


ayy L. u d d d u y 


2700 


aaatcraatt 


aartt"ocrct 


etcraatnefta 

V— * L_ W Q U LUU 


2760 




*- y LL.H.yaLL.L 


l a l i_ t_ i_ i_. 1.- ^ v_. 


2820 


t t tgaaagga 


atgcttttgt 


t caaocraaaci 


2880 


tgattgctgt 


ttcgatggt t 


t catcaatgc 


2940 


tct caacacc 


gtggccgata 


ata teat gat. 


3000 


cagccccgat 


t ccaatgacc 


ggtatgetta 


3060 


ccggcacaca 


t t ccagcaca 


agcatcat ag 


3120 


ctattaattt 


tttggcgctt 


tgttcgtctt 


3180 


cgactgactg 


cggtgtcaaa 


cctaagtgac 


3240 


cqcqaatqqa 


ttcaaacacg 


ccttctccgc 


3300 


cctgaacgat 


agccgctgca 


tttttcagcg 


3360 


acggcatatc 


tgtcacaata 


aaggtattcg 


3420 


ggatcatgtc 


cgcaactgtc 


acaccgacag 


3480 


gtgaatcacc 


gactaaaatc 


atgtcaactc 


3540 


aatcataagc 


ggtcagcatg 


acaateggtt 


3600 


gttttgtttt 


catgttttct 


cctccttcta 


3660 


ccacaccttg 


tagataaagt 


caacaacttt 


3720 
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ttgcaaaatt 


tttcaggaat 


tttagcagag 


gttgttctqg 


atgtagaaca 


aaacatcttt 


3780 


ccgctcttgt 


gctgttagga 


tatctttctt 


qqaaqctaqg 


taggectega 


qttatqqcaq 

3 33 3 


3840 


ttggttaaaa 


ggaaacaaaa 


agaccgtttt 


cacacaaaac 


ggtctttttc 


gatttctttt 


3900 


tacagtcaca 


gecact tttg 


caaaaacegg 


acagct teat 


gecttataac 


tgctgtttcg 


3960 


gtcgacaagc 


tt cgegaage 


ggccgcaaaa 


ttcactggcc 


gtcgtt ttac 


aacatcotoa 

v** vh* y w y l y w* 


4020 


ctaoaaaaac 


cctaacatta 

W *W y y *w y ^— L Si 


cccaact t aa 


t caccttaca 


acacat cccc 


ctttcoccao 

\^ w w L v-^ \y x^. s^, ci y 


4080 


ct* aacat* aa t 
333 \-ac* i— 


aaccfaaoaaa 


"L-» W *w %J W W ► VmI d 


tcarrrttrc 


raacacfttac 

l» a d y l uy v> 




4140 


t ocicicra a t o a 


ccfcctoatac 

> — ^— • l y ci l y 


aatattttct 


ccttarorat 

^■^ t v>civ«y Vi>a 


ctatacaata 


tttcacacca 


4200 


catataatac 


actrtcaata 


caat ct get c 


taataccaca 


taattaaacc 


aoccccoaca 

y \^ \w y u \— t-* 


4260 




rrrrrrt" a;** r*t 

^ 1 — ( — y v, i_ y a \_« i— 


3tac*ttat^*i^ 


cfat t - 1" t~ at 


y aaoaoo l u w 


<- Lcaaaci load 


4 320 




t" ft" a net cit" c c 

l l. l a y y y l l- 


rr'^^tt^^tt 




t*nt~^t"t"^^Aa 


nt patf raaa 

y i_ ^ a i_ Li^aaa 


4 380 






tt^ata^^a*"* 


\-»v-» v.uy v. u a y a 


L. u v. too ly i^y 


a^it"at"1"ar^a3 
ya Ly l Ly uy a 


4 4 40 




r^f- t fTf*a 
aautciL i_ y l. y 


at^ar'^i.aaa^i 


aaayuuayLL. 


t ttcatoata 


LdLL-LvLvaa 


4 500 


r t tat* at "nn 


aftt^-t^t^ta 

y L, L *_ CI w I. Q Llj 


a;** r'aft't'sspri 

^av^uoL taaa 


aaLaaLaaaci 


iirarrart t a a 

^ > — ay — i_ l y a 


rfraat" ^at t* 

< — y a Lay l l 


4 560 


r crrirt at* aao 


f**a at tat at" a 


r^tt^ataf-i^t 


rt'^prnrt' t a 

v» v.an V/y)^. i, l, y 


ant taaarrcr 


Ly vvy vy aay 


4 620 


coacatcacfc 

w ^-j y L* y y y 


ttaaacoaat 


hattaaacat 

L y L U. w y Q-i v.- 


tatttaccaa 

L d L L L y S-^ *w y u 


ctaccttggt 


oatctcacct 

y l l- l y \^ 


4 680 


ttcacotaat 

V- "w "W^ V W W 


aoacaaattc 


ttccaact ga 


z ctgcgcgcg 


aggecaaqeg 


atcttcttct. 


4 740 


tgtccaagat 


aagcctgtct 


agct t caagt 


atgaeggget 


gatactgggc 


cggcaggcgc 


4800 


tccattgccc 


aqtcqqcaqc 


gacatccttc 


qqcqcqattt 


tgccggttac 


tgcgctgtac 


4860 


caaatqcqqq 


acaaegtaag 


cactacattt 


cgctcatcgc 


cagcccagtc 


qqqcqqcqaq 

333 33 3 3 


4920 


ttccatagcg 


ttaaggtttc 


atttagegee 


tcaaatagat 


cctgttcagg 


aaceggatea 


4980 


aagagttcct 


ccgccgctgg 

3 3 33 


acctaccaag 


geaaegctat 


gttctcttgc 


ttttgtcagc 


5040 


aagatageca 


gatcaatgtc 


gatcgtggct 

3 3 33 


ggctcgaaga 


tacctgeaag 


aatgtcattg 


5100 


cgctgccatt 


ctccaaattg 


cagttcgcgc 


ttagctggat 


aacgccacgg 


aatgatgtcg 


5160 


tcgtgcacaa 


caatggtgac 


ttctacagcg 


eggagaatet 


cgctctctcc 


aggggaagee 


5220 


gaagtttcca 


aaaggtcgtt 


gatcaaagct 


cgccgcgttg 


tttcatcaag 


ccttacggtc 


5280 


accgtaacca 


gcaaatcaat 


atcactgtgt 


ggcttcaggc 


cgccatccac 


tgcggagccg 


5340 


tacaaatgta 


cggccagcaa 


cgtcggttcg 


agatggeget 


cgatgacgcc 


aactacctct 


5400 


gatagttgag 


tcgatacttc 


ggcgatcacc 


gcttccctca 


tgatgtttaa 


ctttgtttta 


5460 
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aoacaactac 


cctgctgcgt 


aacatcgttg 


gcgtaacgcg 


ct tgctgct t 


GGatfJCCCCJa 

y V- v_J ^ w yj c* 


taacaagcca 


tgaaaaccgc 


cactgcgccg 


craccaafctac 


ataaacacat 

*--4 w Ol ~J *w l_ 


acactactto 




i" t* rnt erect 1" 




ayv—ayv^yciciy 


Ltyayyua l l. 


t" ci~ nf~ ret' tin 
uy LLLiyy 




ay y La u tyyu 


yyut LiyLiy 


r* t" <"T /""* fT* 1 ^ /"T r"t i"> 


t. tCaygaya C 




accccggacg 


aagtggt teg 


ca tecteggt 


tcty t, l it ty u 


aLyyaatyyy 


Ldugcggacc 


p4*rtrrai"+* +- <^«- 

ctygaL ticy 


atCaCggCaC 


gdLCatCyLg 


LtydtgLtaC 


CCydyayCtt 


ggcacccdgc 


■j-(~irj/->/-.+-f-4-' u /-i 

Lyattv-L l y 


aaiyaCCLtl 


aatayaLLal 




L t L Laaaad L 


LtUL LCdCrdU 


C Ct t~ 51 J3 f PT 


yytuy iituy 


y ig l LyuLay 


T^Tt" t" f T I""**"' (Iff 

*-J VJ L. ^- U M ' — . \— 


f» t~ c\ z\ z$ n n 




y uy l LaL Lyy 


/~> f- <tt tr /~r +- +- 


fit* /"T f"T P" .3 t" 


acctotctat 


rr t" nt" rra r~i~ cii~ 

y ^ y l y a l u <— 


Lyayuty laa 


ctaattactt 


tattttacta 




ttacattgtc 


gatctgt tea 


taataaacaa 


ctgat gtatc 


tatctttttt 


acaccatttt 


atatctaacg 


gt gaacagtt 


gttctacttt 


atacaagagc 


cataagaacc 


tcagatcctt 


gttcgttgtt 


tttgcgtgag 


ccatgagaac 


tcactcaaaa 


attttgeetc 


aaaactggtg 


agtgtttttc 


ttagtccgtt 


aegtaggtag 


tcaccattca 


tttttatctg 


gttgttctca 


agttcaactt 


ggaaaatcaa 


cgtatcagtc 


atattgctgt 


aagtgtttaa 


atctttactt 



ctgctccata acatcaaaca tcgacccacg 5520 
ggcatagact gtaccccaaa aaaacagtca 5580 
ttaccaccgc tgcgttcggt caaggttctg 5640 
cattacagct tacgaaccga acaggcttat 5700 
cacggtgtgc gtcacccggc aaccttgggc 5760 
ctggcgaacg agegcaaggt ttcggtctcc 5820 
ttcttctacg geaaggtget gtgeaeggat 5880 
cggccgtcgc ggcgcttgcc ggtggtgctg 5940 
tttctggaag gegagcateg tttgttcgcc 6000 
agtgagggtt tgcaactgcg ggtcaaggat 6060 
egggagggea agggctccaa ggategggee 6120 
ctgcgcgagc aggggaattg atccggtgga 6180 
attactaatt aattggggac cctagaggtc 6240 
aacggtttac aagcataacg qgttttgctg 6300 
tttgttatca gaategcaga tccggcttca 6360 
agaattgeca tgattttttc cccacgggag.- 6420 
ttgattcgat aagcagcatc gcctgtttca.. 6480 
caagttgtct caggtgttca atttcatgtt 6540 
ttctattagg tgttacatgc tgttcatctg 6600 
etttaaatge accaaaaact egtaaaaget 6660 
catctgtgca tatggacagt tttccctttg 6720 
tgtttgttag tettgatget tcactgatag 6780 
cegtatttag ccagtatgtt ctctagtgtg 6840 
gaaccattga gatcatgett actttgeatg 6900 
agctgaattt ttgcagttaa agcatcgtgt 6960 
gaatctgatg taatggttgt tggtattttg 7020 
agttcggtta cgagatccat ttgtctatct 7080 
gggcggcctc gcttatcaac caccaatttc 7140 
attggtttca aaacccattg gttaagcett 7200 
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t" taaarhrat 

L L 


aataattatt 


ttcaaocatt 


ou^u l \j a a l» l 


l a a a l LL>d ll 


ddyyLLdd ll 




tctatat* t t-o 


ccttataaat" 

l L L ^ c* L 


tttcttttat 


at taat t* rt t 

y L Lay LLULL 


L Lao Laautd 


LLLdLdddLt 


' O c. u 


u iw- d tay dy l 


atttottttr 

aLLLyLLLLL* 


uaaay au t La 


Qua L Lj L LLLod 


aat-1- at- afff 
ydLLdLdLLL 


LdtyddLLLL 


1 ^RH 
/ jDU 


l l LaauLyya 


a a a y a l day y 


L»dd LaLLLLL 


LtdLLddddd 


CLddL LCLdd 


LLLLLCgCLL 


i a a n 

/ *i ft u 


y a y a a o l u gg 


/-i -j +- a fr 4" 4~ 4" <T 4~ 

UdLdyLLLy L 


L,L.dLLyyddd 


a LC LCdddyC 


y— « 4V 4> 4> 4 ^ /— . y— » — , —| 

cl l l aaccaa 


— v y^r i — ff — v 4- 4— y-, 4W y-W 

agga L ucc eg 


•7 c A 
/ jUU 


d L U LtLdUdy 


^4* 4~ /-^ /■» 4- i"i a 

itctcgccdc 


CayCuClCLy 


gLLgcLLuag 


CLaatacacc 


ataagcattt 


/ joU 




4— 4- 4* f~~* 3 +~ 4— 

tgr.Lcat.CdL 


CtyayCy L.aL 


ugg l ta taag 


tgaacga t ac 


cgt cegttet 


Ton 


L f.CC l tg Lay 


ggLLLLCdaL 


/t 4- /*r /■*■ /"y Ac 4* 4" /**t 

cgtggggttg 


agtagtgcca 


cacagca t aa 


aattagct tg 


*7 CO rt 

/ bo U 




ccgLLday lC 


a Lagcgaciia 


a t cgct agt t 


-,-.4-4-4-~ < -,4-f-4- 

catttgcttt 


gaaaacaact 


1 1 H U 


aattcagaca 


tacatctcaa 


ttggtctagg 


i- / __4_+-4-4-— ,-»4- 

tgatt ttaat 


cactatacca 


attgagatgg 


T O rt rt 

7800 


gctayucaau 


gat aat tact 


agtcct tt tc 


ct ttgagttg 


tgggtatctg 


taaattctgc 


/ o bO 


t agacctttg 


ctggaaaact 


J- —4- _ - - 4. f_ _ 4- 

tgtaaatcct 


gctagaccct 


4- y^4- ™ — . — . 4- 4- A 

ctgtaaat tc 


cgctagacct 


i rt o rt 

/ 920 


4-4-/~i4-/'»4-i-r4-4-4- 

rLC"T,C, ? ty LtL 


4-4-4- 4-4.-4- 4- f — 


t at tcaagtg 


~4-4--,4--,— .4-4-4- 

gtta taat 1 1. 


at agaat aa a 


gaaagaataa 


'7 rt O rt 


daaridyd'-da 


aaagaataga 


tcccagccct 


gtgt at aact 


cactactt ta 


gt cagt t ccg 


o U 4 U 


--- 'j i- f- ^ -i 
^dyr,dLCdCa 


aaaggat gtc 


gcaaacgctg 


t*t tgct cctc 


tacaaaacag 


acctt aaaac 


Q 1 rt rt 


LLLddd(jU(.,L 


LdaCjLdyCdC 


- y— * 4— mm --\ ^ ft f* 

ccl cgcaagc 


tcgggcaaat 


cgctgaatat 


4-_,~,4-4-4-4-— 4-— . 

l CCl l l l g l c 


y^l 1 &C\ 

O 1 O'J 


tccgaccatc 


aggcacctga 


gtcgotgtct 


ttttcgtgac 


attcagttcg 


ctgcgctcac 


8220 


ggctctggca 


gtgaatgggg 


gtaaatggca 


ctacaggcgc 


cttttatgga 


ttcatgcaag 


8280 


gaaactaccc 


ataatacaag 


aaaagcccgt 


cacgggcttc 


tcagggcgtt 


ttatggcggg 


8340 


tctgctatgt 


ggtgctatct 


gactttttgc 


tgttcagcag 


ttcctgccct 


ctgattttcc 


8400 


agtctgacca 


cttcggatta 


tcccgtgaca 


ggtcattcag 


act.ggctaat 


gcacccagta 


8460 


aggcagcggt 


atcatcaaca 


ggcttacccg 


tcttactgtc 


aac 




8503 



<210> 94 
<211> 7381 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : recombinant 
pAN006 plasmid 

<400> 94 

ttgcggccgc ttcgaaagct gtaatataaa aaccttcttc aactaaeggg gcaggttagt 60 



i 
I 
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gacattagaa 


aaccgactgt 


aaaaagtaca 


gtcggcatta 


tctcatatta 


taaaagccag 


120 


tcattaggcc 


tatctgacaa 


ttcctgaata 


gagttcataa 


acaatcctgc 


atgataacca 


180 


tcacaaacag 


aatgatgtac 


ctgtaaagat 


agcggtaaat 


atattgaatt 


acctttatta 


240 


atgaattttc 


ctgctgtaat 


aatgggtaga 


aggtaattac 


tattattatt 


gatatttaag 


300 


ttaaacccag 


taaatgaagt 


ccatggaata 


atagaaagag 


aaaaagcatt 


ttcaggtata 


360 


ggtgttttgg 


gaaacaattt 


ccccgaacca 


ttatatttct 


ctacatcaga 


aaggtataaa 


420 


tcataaaact 


ctttgaagtc 


attctttaca 


ggagtccaaa 


taccagagaa 


tgttttagat 


480 


acaccatcaa 


aaattgtata 


aagtggctct 


aacttatccc 


aataacctaa 


ctctccgtcg 


540 


ctattgtaac 


cagttctaaa 


agctgtattt 


gagtttatca 


cccttgtcac 


taagaaaata 


600 


aatgcagggt 


aaaatttata 


tccttcttgt 


tttatgtttc 


ggtataaaac 


actaatatca 


660 


atttctgtgg 


ttatactaaa 


agtcgtttgt 


tggttcaaat 


aatgattaaa 


tatctctttt 


720 


ctcttccaat 


tgtctaaatc 


aattttatta 


aagttcattt 


gatatgcctc 


ctaaattttt 


780 


atctaaagtg 


aatttaggag 


gcttact bgt 


ctgctttctt 


cattagaatc 


aatccttttt 


840 


taaaagtcaa 


tattactgta 


acataaatat 


atattttaaa 


aatatcccac 


tttatccaat 


900 


tttcgtttgt 


tgaactaatg 


ggtgctttag 


ttgaagaata 


aagaccacat 


taaaaaatgt 


960 


ggtcttttgt 


gtttttttaa 


aggatttgag 


cgtagcgaaa 


aatccttttc 


tttcttatct 


1020 


tgataataag 


ggtaactatt 


gaattcggta 


ccaagagttt 


gtagaaacgc 


aaaaaggcca 


1080 


tccgtcagga 


tggccttctg 


cttaatttga 


tgcctggcag 


tttatggcgg 


gcgtcctgcc 


1140 


cgccaccctc 


cgggccgttg 


cttcgcaacg 


ttcaaatccg 


ctcccggcgg 


atttgtccta 


1200 


ctcaggagag 


cgttcaccga 


caaacaacag 


ataaaacgaa 


aggcccagtc 


tttcgactga 


1260 


gcctttcgtt 


ttatttgatg 


cctggcagtt 


ccctactctc 


gcatggggag 


accccacact 


1320 


accatcggcg 


ctacggcgtt 


tcacttctga 


gttcggcatg 


gggtcaggtg 


ggaccaccgc 


1380 


gctactgccg 


ccaggcaaat 


tctgttttat 


cagaccgctt 


ctgcgttctg 


at.ttaatctg 


1440 


tatcaggctg 


aaaatcttct 


ctcatccgcc 


aaaacaggat 


cctacggaaa 


tggagcggca 


1500 


aaaccgtttt 


actctcaaaa 


tcttaaaaga 


aaacccccga 


taaagggggc 


ttttcttcta 


1560 


caaaattgta 


cgggctggtt 


cgttccccag 


catttgttca 


attttgtttt 


gatcattcag 


1620 


aacagccact 


ttcggctcat 


ggcttgccgc 


ttcttgatca 


gacatcattt 


tgtaggaaat 


1680 


aataatgacc 


ttatctcctt 


cctgcacaag 


gcgtgcggct 


gcaccgttta 


agcatatgac 


1740 


gccgcttccc 


cgtttaccag 


gaataatata 


cgtttcaaga 


cgtgctccat 


tattattatt 


1800 
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cacaatttgt actttttcat taggaagcat 
tgtaatgctt cccacatagt tcaggtttgc 
gctcatcatt gttcgataca tattatattc 
tatctattaa acgcgctttt gaaaaagcaa 
tttcattcac aggctcgagt tccggatagg 
tagtcgtttc aatgatatct tttgcagctt 
cttggacaag ttccgcactt gtttgaaggg 
cctcagctgt taagtataca ttgcgagagc 
tatcgacagg aaccaattca atatccatga 
ctacctgctg cgcatctttt aaaccgaaat 
gcttcgtcag tacgatcgcg accccgtcaa 
cgtctgtgcg tctttctaca tgaatcgtga 
catctggcgt. aaaaagaata tcgactccgg 
tatcgcgcgg atatgcttca aaatcttcat 
tactcataat aacggcgtcg ttttcttgtc 
catgcagaaa ccccatcgtc ggaacaaatc 
ttatggcttc tttcagctgt gaaatatcag 
aagccgtcaa gcactgtctg gttcatttga 
tgtcttacat cctgaacata tccgctgatt 
tattgcttta caaatttagg tgttctctca 
agaacttgtc cgtccgcttt cacaccagcc 
tcggcaattt tggctgtgag ttctgccggc 
gcttcttcgc attttatact gtcttctatt 
tgtactttat agccgcccag tacgccgact 
actggaatgc ctccaagcgt caatgcgcga 
agcttcagtg cgtcagctcc gctttcctga 
tccttagaca ggtgataaga cataaacggc 
ccccttttaa cggcttttgt atgatggatc 
tcaaggccga ggacgaccat tccaagtgaa 
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tcccacagca 


tcaatgagat 


cttcatcaat 


1860 


ttccgtaaca 


gttgccctgt 


gaagtttgcc 


1920 


tctccatttc 


tcgaatatca 


ataatgatat 


1980 


ctgcaacagc 


gagaatcatc 


tttccagcaa 


2040 


aataaagctc 


tacatagtct 


atggttccgc 


2100 


ttatcaccgc 


ttcaggatct 


ctttcaccgg 


2160 


cccgatacag 


cttaggcgct 


tcttttcttt 


2220 


ttttggctaa 


gccgtcttcc 


tctctgaccg 


2280 


agaagtcgct 


gattaaccca 


tcaacaacag 


2340 


aggcacgagt 


cggcttgact 


agattgaaaa 


2400 


aatgtccttc 


tcttgagcgc 


ccgcataaca 


2460 


cattcttttc 


accgggatac 


atatcatgag 


2520 


cgttttctgc 


aagagctgca 


tcccgctcaa 


2580 


tagggccgaa 


ttgtgcagga 


t loacaaaaa' 


2640 


ttgctt tgtc 


tgctaaggtt 


aaatgcccct 


2700 


cgattgactt 


gccctctgaa 


tggtattgtt: 


2760 


taatctgtct 


catcttattt 


tcccccgtac 


2820 


aaggaatgct 


tttgttcagg 


gaaagcacga 


2880 


gctgtttcga 


tggtttcatc 


aatgcgcgta 


2940 


acaccgtggc 


cgataatatc 


atgataaacg 


3000 


ccgattccaa 


tgaccggtat 


gcttagcgtc 


3060 


acacattcca 


gcacaagcat 


catagctcct 


3120 


aattttttgg 


cgctttgttc 


gtctttgccc 


3180 


gactgcggtg 


tcaaacctaa 


gtgactgact 


3240 


atggattcaa 


acacgccttc 


tccgccctca 


3300 


acgatagccg 


ctgcattttt 


cagcgtatct 


3360 


atatctgtca 


caataaaggt 


attcggcgca 


3420 


atgtccgcaa 


ctgtcacacc gacagttgaa 


3480 


tcaccgacta 


aaatcatgtc 


aactcccgct 


3540 
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tgttcagcaa 


gtttagctgc 


cggataatca 


taagcggtca 


gcatgacaat 


cggttcttca 


3600 


gactccttca 


tttttagaaa 


atccagtttt 


gttttcatgt 


tttctcctcc 


tctagagcgt 


3660 


cctgctgttg 


ttaagattat 


tataccacac 


cttgtagata 


aagtcaacaa 


ctttttgcaa 


3720 


aatttttcag 


gaattttagc 


agaggttgtt 


ctggatgtag 


aacaaaacat 


ctttccgctc 


3780 


ttgtgctgtt 


aggatatctt 


tcttggaagc 


taggtaggcc 


tcgagttatg 


gcagttggtt 


3840 


aaaaggaaac 


aaaaagaccg 


ttttcacaca 


aaacggtctt 


tttcgatttc 


tttttacagt 


3900 


cacagccact 


tttgcaaaaa 


ccggacagct 


tcatgcctta 


taactgctgt 


ttcggtcgac 


3960 


ctgcaggcat 


gcaagcttcg 


cgaagcggcc 


gccgacgcga 


ggctggatgg 


ccttccccat 


4020 


tatgattctt 


ctcgcttccg 


gcggcatcgg 


gatgcccgcg 


ttgcaggcca 


tgctgtccag 


4080 


gcaggtagat 


gacgaccatc 


agggacagct 


tcaaggatcg 


ctcgcggctc 


ttaccagcct 


4140 


aacttcgatc 


actggaccgc 


tgatcgtcac 


ggcgatttat 


gccgcctcgg 


cgagcacatg 


4200 


gaacgggttg 


gcatggattg 


taggcgccgc 


cctatacctt 


gtctgcctcc 


ccgcgttgcg 


4260 


tcgcggtgca 


tggagccggg 


ccacctcgac 


ctgaatggaa 


gccggcggca 


cctcgctaac 


4320 


ggattcacca 


ctccaagaat 


tggagccaat 


caattcttqc 


ggagaactgt 


gaatgcgcaa 


4380 


accaaccctt 


ggcagaacat 


atccatcgcg 


tccgccatct 


ccagcagccg 


cacgcggcgc 


,4440 


3tctcgggca 


gcgttgggtc 


ctggccacgg 


gtgcgcatga 


tcgtgctcct 


gtcgttgagg* 4 500 


acccggctag 


gctggcgggg 


ttgccttact 


ggttagcaga 


atgaatcacc 


gatacgcgag 


4560 


cgaacgtgaa 


gcgactgctg 


ctgcaaaacg 


tctgcgacct 


gagcaacaac 


atgaatggtc 


.4620 


ttcggtttcc 


gtgtttcgta 


aagtctggaa 


acgcggaagt 


cagcgccctg 


caccattatg 


4680 


ttccggatct 


gcatcgcagg 


atgctgctgg 


ctaccctgtg 


gaacacctac 


atctgtatta 


4740 


acgaagcgct 


ggcattgacc 


ctgagtgatt 


tttctctggt 


cccgccgcat 


ccataccgcc 


4800 


agttgtttac 


cctcacaacg 


ttccagtaac 


cgggcatgtt 


catcatcagt 


aacccgtatc 


4860 


gtgagcatcc 


tctctcgttt 


catcggtatc 


attaccccca 


tgaacagaaa 


ttccccctta 


4920 


cacggaggca 


tcaagtgacc 


aaacaggaaa 


aaaccgccct 


taacatggcc 


cgctttatca 


4 980 


gaagccagac 


attaacgctt 


ctggagaaac 


tcaacgagct 


ggacgcggat 


gaacaggcag 


5040 


acatctgtga 


atcgcttcac 


gaccacgctg 


atgagcttta 


ccgcagctgc 


ctcgcgcgtt 


5100 


tcggtgatga 


cggtgaaaac 


ctctgacaca 


tgcagctccc 


ggagacggtc 


acagcttgtc 


5160 


tgtaagcgga 


tgccgggagc 


agacaagccc 


gtcagggcgc 


gtcagcgggt 


gttggcgggt 


5220 


gtcggggcgc 


agccatgacc 


cagtcacgta 


gcgatagcgg 


agtgtatact 


ggcttaacta 


5280 
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tgcggcatca 


gagcagattg 


tactgagagt 


gcaccatatg 


cggtgtgaaa 


taccgcacag 


5340 


atgcgtaagg 


agaaaatacc 


gcatcaggcg 


ctcttccgct 


tcctcgctca 


ctgactcgct 


5400 


gcgctcggtc 


gttcggctgc 


ggcgagcggt 


atcagctcac 


tcaaaggcgg 


taatacggtt 


5460 


atccacagaa 


tcaggggata 


acgcaggaaa 


gaacatgtga 


gcaaaaggcc 


agcaaaaggc 


5520 


caggaaccgt 


aaaaaggccg 


cgttgctggc 


gtttttccat 


aggctccgcc 


cccctgacga 


5580 


gcatcacaaa 


aatcgacgct 


caagtcagag 


gtggcgaaac 


ccgacaggac 


tataaagata 


5640 


ccaggcgttt 


ccccctggaa 


gctccctcgt 


gcgctctcct 


gttccgaccc 


tgccgcttac 


5700 


cggatacctg 


tccgcctttc 


tcccttcggg 


aagcgtggcg 


ct t tctcata 


gctcacgctg 


5760 


taggtatctc 


agttcggtgt 


aggtcgttcg 


ctccaagctg 


ggctgtgtgc 


acgaaccccc 


5820 


cgttcagccc 


gaccgctgcg 


ccttatccgg 


taactatcgt 


cttgagtcca 


acccggtaag 


5880 


acacgactta 


tcgccactgg 


cagcagccac 


tggtaacagg 


attagcagag 


cgaggtatgt 


5940 


aggcggtgct 


acagagttct 


tgaagtqgtg 


gcctaactac 


ggctacacta 


gaaggacagt 


6000 


atttqgtatc 


tgcgctctgc 


tgaagccagt 


taccttcgga 


aaaagagttg 


qtagctcttg 


6060 


atcoggcaaa 


caaaccaccg 


ctggtagcgg 


tggttttttt 


gtttgcaagc 


agcagat tac 


6120 


qcgcagaaaa 


aaaggatctc 


aagaagatcc 


tttgatcttt 


tctacggggt 


ctgacgctca 


6180 


gtggaacgaa 


aactcacgtt 


aagggatttt 


ggtcatgaga 


ttatcaaaaa 


ggatccccac. 


6240" 


ctagatcctt 


t taaattaaa 


aatgaagttt 


taaatcaatc 


taaagtatat 


atgagtaaac 


6300 


ttggtctgac 


agttaccaat 


gcttaatcag 


tgaggcacct 


atctcagcga 


tctgtctatt 


6360 


tcgttcatcc 


atagttgcct 


gactccccgt 


cgtgtagata 


actacgatac 


gggagggctt 


6420 


accatctggc 


cccagtgctg 


caatgatacc 


gcgagaccca 


cgctcaccgg 


ctccagattt 


6480 


atcagcaata 


aaccagccag 


ccggaagggc 


cgagcgcaga 


agtggtcctg 


caactttatc 


6540 


cgcctccatc 


cagtctatta 


attgttgccg 


ggaagctaga 


gtaagtagtt 


cgccagttaa 


6600 


tagtttgcgc 


aacgttgttg 


ccattgctgc 


aggcatcgtg 


gtgtcacgct 


cgtcgtttgg 


6660 


tatggcttca 


ttcagctccg 


gttcccaacg 


atcaaggcga 


gttacatgat 


cccccatgtt 


6720 


gtgcaaaaaa 


gcggttagct 


ccttcggtcc 


tccgatcgtt 


gtcagaagta 


agttggccgc 


6780 


agtgttatca 


ctcatggtta 


tggcagcact 


gcataattct 


cttactgtca 


tgccatccgt 


6840 


aagatgcttt 


tctgtgactg 


gtgagtactc 


aaccaagtca 


ttctgagaat 


agtgtatgcg 


6900 


gcgaccgagt 


tgctcttgcc 


cggcgtcaat 


acgggataat 


accgcgccac 


atagcagaac 


6960 


tttaaaagtg 


ctcatcattg 


gaaaacgttc 


ttcggggcga 


aaactctcaa 


ggatcttacc 


7020 
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gctgttgaga tccagttcga tgtaacccac 

tactttcacc agcgtttctg ggtgagcaaa 

aataagggcg acacggaaat gttgaatact 

catttatcag ggttattgtc tcatgagcgg 

acaaataggg gttccgcgca catttccccg 

tattatcatg acattaacct ataaaaatag 
a 
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tcgtgcaccc aactgatctt cagcatcttt 7080 
aacaggaagg caaaatgccg caaaaaaggg 7140 
catactcttc ctttttcaat attattgaag 7200 
atacatattt gaatgtattt agaaaaataa 7260 
aaaagtgcca cctgacgtct aagaaaccat 7320 
gcgtatcacg aggccctttc gtcttcaaga 7380 

7381 



